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Preface 


The problems in The Econometrics of Financial Markets have been tested in PhD courses 
at Harvard, MIT, Princeton, and Wharton over a number of years. We are grateful to the 
students in these courses who served as guinea pigs for early versions of these problems, 
and to our teaching assistants who helped to prepare versions of the solutions. We also 
thank Leonid Kogan for assistance with some of the more challenging problems in Chapter 
9. 


PREFACE 


Problems in Chapter 2 


Solution 2.1 


2.1.1 Recall the martingale property given by (2.1.2) and observe that the mean-squared 
error of the time-t forecast X, of price Pi441 is 


(82.1.1) E[((X: — Pr) lPe,-..] = (X«— Po + ELPA — Pé |Ps,...] - 
This expression is minimized by the forecast X; = Pi. 
2.1.2 Let | > 0. Then 
E[(P — Pı-r)(Pi-r-ı — Pı-a-ı)]) = El E[Pe — Pj | Pig] 
(82.1.2) (Pira Pı-a-ı) | 
= E[0(P,-4-; — Pi=2r-1)] =0 . 


Solution 2.2 
Denote the martingale property (2.1.2) by M. Then 
(82.2.1) RW1 => RW2 > M > RW3, 


and no other implication holds in general. For example, consider the following counter- 
examples. Let (€. 721 be a sequence of random variables drawn independently from a 
uniform distribution over the interval [—1, 1] and £y = 0. Then the process with increments 
(i) eon —1 = En and ezn = |€,| — 1/2 satisfies RW3 but not M; (ii) en = En&n-ı satisfies M 
but not RW2; (iii) en = n£n satisfies RW2 but not RWI. 


Solution 2.3 


A necessary condition for the log-price process p: in (2.2.9) to satisfy RW1 is a+8 = 1. Let 
c =a + and consider the set of all non-RW1 Markov processes (2.2.9), i.e., c 4 1. The 
restriction CJ = 1 is equivalent to aß = c/4. The constraints 0 < a, < 1 are satisfied 
exactly for c € [1, 4/3] and therefore the set of all two-state Markov chains represented by 
the pair (a, 8) that cannot support any RW1 process but still yields CJ = 1 is simply 


(82.3.1) {1+ V1 — 671,1 VA- ec1)e/2;1 < c € 4/3}. 


Such Markov chains do generate sequences, reversals, etc. 


Solution 2.4 


For a stationary process, Var[Z;] = Var[Z;_,] and Cov[Zi-4, Zi-i] = Cov[Z:, Zi- 1] 
Thus, we have 

q—1 q—1 
(82.4.1) Var[Z((q) = Y; Var[Zi-1] + 2) (q - k)Cov[Zs, Zi] 

k=0 k=1 


which yields (2.4.19). The coefficients of Cov[Zi, Zi-.,] are simply the number of k-th 
order autocovariance terms in the variance of the multiperiod return Z;(q) (recall that this 
multiperiod return is the sum of q one-period returns). The coefficients decline linearly 
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Ten individual stocks used for problem 2.5, identified by CRSP permanent number 
PERMNO, CUSIP identifier, (most recent) ticker symbol and abbreviation of full name. 


PERMNO CUSIP Ticker COMPANY NAME 


18075 03203710 AP AMPCO-PITTSBURGH CORP. 
30840 21161520 CUO CONTINENTAL MATERIALS CORP. 
26470 29265N10 EGN ENERGEN CORP. 

32096 36480210 GAN GARAN INC. 


19174 37006410 GH GENERAL HOST CORP. 

12095 37083810 GSX GENERAL SIGNAL CORP. 

15747 45870210 IK INTERLAKE CORP. 

12490 45920010 IBM INTERNATIONAL BUSINESS MACHS. CORP. 


18286 75510310 RAY RAYTECH CORP. DE 
15472 98252610  WWY WRIGLEY, WILLIAM JR. Co. 


'TABLE 2.1. Ten individual stocks for Problem 2.5 


Periods Po to P4 for daily and monthly data. 


Daily Periods Monthly Periods 
Period Calendar Days Length (Days) Calendar Days Length (Months) 


A 620703-941230 8179 620731-941230 390 
Aı 620703-700923 2045 620731-700831 98 
As 700924-781027 2045 700930-780929 97 
Az 781030-861128 2044 781031-861031 97 
Aq 861201-941230 2045 861128-941230 98 


TABLE 2.2. Periods for Daily and Monthly Data 


with k until they reach zero for k = q because there are successively fewer and fewer 
higher-order autocovariances. 

From (2.4.19) it is apparent that individual autocorrelation coefficients can be non- 
zero but their weighted average can be zero. For example, according to (2.4.19), VR(3) = 
1+2(3p1 + 3p2), hence a non-random-walk process with pı = —4 and p» = 5 will satisfy 
VR(3) = 1. Therefore, the variance ratio test will have very lower power against such 
alternatives, despite the fact that they violate the random walk hypothesis. 


Solution 2.5 


We consider the daily and monthly returns of the ten individual stocks considered in 
Chapter 1 (see Table 1.1). We use CRSP daily data consisting of 8,179 days from July 3, 
1962 to December 30, 1994 and CRSP monthly data consisting of 390 months from July 
31, 1962 to December 30, 1994. For these ten stocks there are 23 missing daily returns 
and 4 missing monthly returns in our sample. The stocks are identified in Table 2.1, and 
we shall refer to them by their ticker symbols (value-weighted and equal-weighted indexes 
will be denoted by VW and EW). 

Denote the entire sample period by A and the four consecutive subperiods of approx- 
imately equal length by Ai, A», As, A4, respectively (note that these periods differ for 
daily and monthly data). Descriptions of lengths and starting and ending dates of the 
periods are given in Table 2.2. 
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Statistics for daily and monthly simple and continuously compounded re- 
turns. All the statistics fi, 6, and f(1) are reported in percent. 


Security Simple Returns Cont. Comp. Returns 
Daily Sampling Monthly Sampling Daily Sampling Monthly Sampling 

Period Ê $ Ba) Ê $ 1209) Ê $ Ba) Ê ĉ Ba) 
vu A 0.044 0.803 19.4 0.96 4.37 4.8 0.041 0.807 19.3 0.85 4.38 5.8 
A1 0.035 0.635 25.9 0.76 3.82 6.2 0.033 0.635 25.9 0.68 3.82 ren 

A2 0.026 — 0.814 29.4 0.71 4.65 5.8 0.022 0.813 29.4 0.60 4.60 6.6 

A3 0.070 0.821 16.0 1.40 4.58 -4.7 . 0.067 0.822 16.0 1.29 4.55 -4.7 

Az 0.045 — 0.913 11.0 0.95 4.36 9.8 0.040 0.822 114 0.85 4.49 E 

EW A 0.078 0.685 38.6 1.25 5.67 22.0 0.075 0.687 38.7 1.08 5.67 22.2 
A1 0.069 0.728 38.8 1.18 5.44 18.9 0.066 — 0.728 38.8 1.03 5.44 — 20.8 

Az 0.060 0.696 49.5 1.37 6.61 19.6 0.057 0.696 49.5 1.16 6.39 21.2 

Az 0.082 0.644 33.1 1.54 5.51 16.4 0.079 0.646 33.1 1.38 5.55 15.0 

Az 0.100 0.669 30.4 0.91 5.00 33.6 0.097 0.676 30.9 0.77 5.22 31.7 

AP A 0.053 2.411 -3.7 1.06 10.62 0.3 0.024 2.396 -4.1 0.52 10.31 0.2 
A1 0.076 2.983 -6.2 1.88 12.14 1.9 0.032 2.952 -6.9 0.68 11.58 2.4 

Az 0.070 2.313 -6.5 1.54 9.15 -7.1 0.044 2.295 -6.9 1.13 8.82 -6.6 

A3 0.042 1.701 7.5 0.94 9.50 3.9 0.028 1.694 7.4 0.50 9.34 3.7 

Az 0.024 2.472 -2.8 0.45 11.37 -0.9 -0.007 2.470 -3.2 -0.18 11.19 -2.1 

cuo A 0.143 5.239 -20.9 1.65 17.76 -7.0 0.009 5.155  -21.8 0.19 16.96 -9.7 
A1 0.241 6.722 -26.7 2.02 18.92 -1.6 0.022 6.590  -28.5 0.46 17.14 -5.1 

Az 0.191 6.699  -29.2 1.39 19.60  -11.1  -0.027 6.577  -30.2 -0.25 17.52 -12.4 

Az 0.140 3.523 9.5 3.11 18.47 -3.5 0.079 — 8.497 8.4 1.48 18.36 -7.9 

A4  -0.000 2.692 13.8 0.09 13.22  -14.1 -0.038 2.714 15.2 -0.88 14.49  -12.8 

EGN A 0.054 1.407 -6.8 1.09 5.75 -7.0 0.044 1.405 -6.9 0.94 5.55 -6.8 
A1 0.022 1.083 -8.8 0.43 3.65 -14.9 0.051 1.080 -8.3 0.36 3.60 -14.7 

Az 0.047 1.636 -12.4 0.97 6.80 3.2 0.034 1.637  -12.5 0.76 6.19 4.4 

A3 0.091 1.437 -8.6 1.79 5.45 -12.9 0.081 1.434 -8.8 1.63 5.81 -13.2 

Az 0.056 — 1.415 2.8 1.20 6.51  -14.0 0.046 1.411 2.8 0.99 14.49 -12.4 

GN A 0.079 2.349 4.4 1.65 11.30 2.8 0.051 2.333 4.1 1.03 10.92 4.2 
A1 0.088 2.886 8.2 1.76 14.12 12.0 0.047 2.852 7.8 0.84 13.34 14.3 

Az 0.085 2.729 -0.7 1.95 11.71 -5.7 0.047 2.728 21,2 1.29 11.18 -5.1 

Az 0.106 — 1.918 -0.8 1.95 8.64 -2.5 0.088 1.910 -0.2 1.56 8.61 -1.0 

Az 0.036 — 1.614 14.8 0.93 9.94 4.9 0.023 1.601 15.1 0.44 9.94 5.5 

GH A 0.070 2.790 -2.2 1.88 11.65 6.3 0.032 2.768 -2.4 0.66 11.58 5.7 
A1 0.069 3.103 -6.0 1.06 11.91 3.4 0.022 8.074 -6.2 0.85 12.00 3.6 

Az 0.060 2.936 4.3 1.80 12.68 19.1 0.018 2.890 3.7 0.55 12.05 17.6 

A3 0.060 2.389 -1.0 3.27 10.94 12.5 0.126 — 2.373 “19 2.64 10.94 -12.0 

A4  -0.000 2.677 -6.0 -0.29 10.69 5.3 -0.037 2.682 -5.6 -0.87 10.78 5.4 

asx A 0.054 1.660 11.6 1:12 8.18 2.7 0.040 1.661 11.7 0.83 8.21 3.7 
A1 0.063 1.866 TA 1.45 9.04 1.6 0.046 — 1.862 7.3 0.89 8.82 2.1 

A2 0.055 1.710 19.7 1.37 8.96 6.6 0.041 1.710 19.8 1.05 8.85 7.8 

A3 0.042 1.600 6.5 0.92 6.75 -7.2 0.042 1.599 6-5 0.69 6.67 -5.9 

AA 0.042 1.436 13.4 1.03 7.74 3.6 0.031 1.443 14.1 0.70 8.30 4.9 

IK A 0.043 — 2.156 0.4 0.86 9.37 -6.5 0.020 2.145 0.3 0.43 9.22 -5.0 
A1 0.031 1.395 -0.7 0.69 6.42 -15.1 0.022 1.891 -0.8 0.49 6.81  -14.0 

A3 0.064 1.475 6.0 0.71 7.19 oe 0.040 1.470 5.8 1.12 6.79 -3.9 

A3 0.102 1.441 8.6 2.12 4.58 -6.4 0.041 1.431 8.5 1.85 7.03 -6.7 

A4  -0.025 3.518 -1.8 -0.73 14.18 -8.4 0.031 3.498 -2.1 -1.73 14.02 -7.2 

IBN A 0.039 1.423 -0.4 0.81 6.17 6-6 0.029 1.427 -0.4 0.61 6.19 6.9 
A1 0.068 1.257 6.2 1.89 5.62 6.9 0.060 — 1.255 6.2 1.22 5.56 7.5 

Az 0.028 1.355 3.8 0.66 5.97 1.6 0.019 1.351 3.8 0.48 5.90 1.4 

Az 0.058 1.375 -6.7 1.10 5.46 44 0.048 1.370 -6.7 0.95 5.37 4.5 

Az 0.002 1.670 -2.8 0.07 7.35 5.1 -0.012 1.690 -2.8 -0.20 7.57 5.1 

RAY A 0.050 3.388 -0.6 0.83 14.88 -12.0 -0.008 3.362 -1.4 -0.13 13.65 -11.7 
A1 0.014 1.426 10.3 0.32 6.63 15.2 0.004 — 1.449 10.0 0.08 6.97 18.0 

A2 0.062 1.914 12.1 1.53 8.59 -9.9 0.043 1.904 12.1 LAT 8.47 -8.7 

Az  -0.014 3.051 8.6 -0.43 15.28 -20.7 -0.060 3.027 7.9 -1.57 15.00 -18.1 

Az 0.137 5.558 -5.6 1.88 22.98 -11.1  -0.014 5.505 -6.6 -0.23 19.83  -12.5 

wy A 0.072 1.446 5.6 1.51 6.67 2.8 0.061 1.447 5.6 1.29 6.55 2.0 
A1 0.026 0.864 5.2 0.56 3.61 -8.8 0.022 0.862 5.2 0.50 3.58 -3.2 

A3 0.036 — 1.355 12.4 0.90 7.47 17.6 0.027 1.361 12.3 0.62 7.40 15.7 

A3 0.110 1.510 7.2 2.20 6.21 -10.0 0.099 1.504 7.1 1.99 6.10 -10.3 

Az 0.116 1.868 0.5 2.49 8.25 -5.4 0.098 1.873 0.7 2.05 8.08 -5.6 


TABLE 2.3. Statistics for Daily and Monthly Simple and Continuously 
Compounded Returns 


2.5.1 See the left side of Table 2.3 for the required statistics. 
2.5.2 See the right side of Table 2.3. Tf r denotes net simple return in percent, then 
100 x log(1 + r/100) is the corresponding continuously compounded return in percent. 
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2.5.3 See Figure 2.1 for the required plots. Returns are truncated to fit in the interval 
[-3%, 396], i.e., returns smaller than —3% are replaced by a return of —3%, and returns 
larger than 3% are replaced by a return of 3%. 
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FIGURE 2.1. Histograms of returns on indexes; 1962-1994 
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2.5.4 Use results from Problems 2.5.1 and 2.5.2 (i.e., Table 2.3), counts from Table 2.2, the 
assumption that returns are IID, and the asymptotic normality of fi to obtain estimates 
of the 9996 confidence intervals. 

2.5.5 Compute the statistics of interest as in Problem 2.5.1. See also Table 1.1 in the 
text for the statistics for the entire sample period. The variances of your estimates can be 
estimated via the bootstrap (see Efron and Tibshirani [1993]) under the assumption that 
returns are temporally IID. Computing the exact variances for estimators of skewness, 
kurtosis, and the studentized range is possible under certain distributional assumptions 
for returns, but is quite involved so the bootstrap is the preferred method—it simplifies 
the estimation greatly: no additional sampling theory is needed). Use the asymptotic 
normality of your estimators to perform the tests. 


Problems in Chapter 3 


Solution 3.1 
Using (3.1.4), we obtain 


(83.1.1) Ep) = XO EIXa(K)]E[ri c] = Ya-m)am = m 
k=0 k=0 
as in (3.1.9). Observe that 
El) = So Elak) X (DEI terit] 
k.1=0 


£ > E[Xi (k)]E[r?.—..] t 
k=0 


(S3.1.2) 2 5 5 E[X à (k)Xu(DElrie-r]Elri,t-ı] 
k=0 1>k 
= of + 2m y Y main" 
k=0 1>k 


= 0424/10), 


hence 
(83.1.3) Varr] = El(r%)?] — = 0? + 2mp? /(1 — mi) 
as in (3.1.10). Next, for n > 0 we have 
Elráriia] = DY BEDXaQ)XG- (D]E[ri -kri t-n] 
k=0 I=0 
n-1 oo 
(83.1.4) = px — mia? (1 — mi) 
k=0 [=0 
which yields the first part of (3.1.11). For 4 Z j and n > 0 we have 
E[írjc4 = DD, EXEL- D] 
k=0 1=0 
(83.1.5) = mm + —m)ni " (1 — 1), Bij OF 
1=0 
a er REL 


and the second part of (3.1.11) follows. Equation (3.1.12) is direct consequence of (3.1.10) 
and the first part of (3.1.11). 
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Solution 3.2 
Consider the case where the common factor f; is the (observable) market portfolio. Then 
the true beta of security i is 8; as in (3.1.1) and the beta computed from observed returns 
B? is given by 
Be Covlris, f«]/ Varl[fi] 
Elfe S 5 Xa(k)ria-i]/03 
k=0 


= (L= ri) bi 


(53.2.1) 


Thus, the beta will be biased towards 0 if nonsynchronous trading is not properly ac- 
counted for. 


Solution 3.3 


3.3.1 Let Pj and Qi; denote the unconditional probabilities that 0j, = 0 and 6% = 1, 
respectively and let P; and Q; be the corresponding steady-state probabilities. Then 
(3.5.1) yields 


(83.3.1) Pa = giPua-ict-c)Qu- 


and similarly for Z. In steady state, P; = Pi: = Pi ¿-1 and Qi = Qi: = Qit-1, hence 


lr 
83.3.2 Be i 
( ) 2 — (ri + m1) 
1—7i 
Qi = 2 — (mi + T) 


Therefore, the unconditional steady-state mean, variance, and first-order autocorrelation 
of dir is 


(83.3.3) ps = Qi 
(53.3.4) 03; Elsi] — Elia” = Qil- Qi) 
(S3.3.5) ysii(1) E[óuói4-i1]— uà; = Qi(mi- QJ. 


3.3.2 To calculate the statistics of observed returns, use (3.1.4). For the mean we have 


(83.3.6) pri = DEAG A 
k=0 


xdi (so «neta m) = u(niQi * Pi) , 


k=1 


for the variance 


2 
oki = E 


(Arme) | — pri 
k=0 


2 (#1Qs+ Y ElXu Qnax(, )] | — 12 
k,l=1 


(S3.3.7) 


2 P; 1 1 2 
= è (— +Q- (QRP), 
ki (nos + TiQ: — (riQi + Pi) 
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and for the first-order autocovariance 


yi) = E Y Xul) Xi Ri Rió — HRi 
k,I=0 
(S3.3.8) = Y mnuE[XG-a (0)] — wi 
1=0 


= m (mi (1Q;+Po)?) . 

Thus, serial correlation in 6; decreases the mean uri as compared to the case of no 
nonsynchronous-trading effects, ceteris paribus, since 1;Q; + P; < 1 for 0 < Ti, T; < 1. 

3.3.3 Assume we are given a sequence (0i. of no-trade indicators. For convenience, 
we shall condition on the initial no-trade indicator do (the extension to the general 
case is straightforward). Denote by nioo, nioi, niio, and nioo the counts of all pairs of 
consecutive days with no-trade patterns ‘00’,‘01’,‘10’, and ‘11’, respectively. Therefore, 
3 joo, Risk = T. Since d+ follows the Markov process (3.5.1), the log-likelihood function 


of the sequence {ði} is 
(S3.3.9) £ C = nioolog Ti + nioi log(1 — 14) + 
Ni10 log(1 — Ti) +711 log T: A 


The maximum likelihood estimators of 7;,7; are 


(S3.3.10) pa, ge 
Nioo + Nior 
n Nili 
m = —. 
Ni10 + Nii 
and the Fisher information matrix is 
à 1 Tige + Gane 0 
(S3.3.11) ilti; mi) = E * p un n 


so that our estimates 7;,7; are asymptotically independent and normal, with asymptotic 
variances estimated efficiently by 


(83.3.12) 85, = (W(1— fi)(nioo + nio1)) 7" 
ôb, = (i — Âi) (niao + nai)) 


The results of the empirical analysis are given in Tables 3.1 and 3.2. In Table 3.1, the non- 
trading counts are reported for six securities using five years of daily data from January 4, 
1988 to December 31, 1992. The data was extracted from the CRSP daily master file: out 
of 1,120 ordinary common shares continuously listed on the NYSE over this time span, 
360 did not trade at least on one of the NYSE trading dates, and 56 did not trade on 
at least 100 days out of 1,517 days in total. Our sample is a randomized selection of six 
stocks from the latter set. Values for n;o and n;ı, defined analogously to nioo etc., are 
also provided for convenience. Note that n;o1 and n;ıo coincide in some cases. 
Estimates of m; and 7; are given in Table 3.2. 


Solution 3.4 


Let the Markov process for I; be given by the transition probability matrix 


1-p 
83.4.1 CIT tae y 
(53.4.1) Ca 


with steady-state probabilities of I; being —1 and 1 given by P = (1 — q)/(2 — p — q) and 
Q = (1— p)/(2 — p — q), respectively. 
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Input Data for Problem 3.1.3. Representative sample from infrequently traded (at 
least 100 no-trade days in the sample interval) ordinary common shares continuously 
listed on the NYSE from 1988 to 1992. Each stock is identified by its ticker symbol and 
CUSIP number. Counts of days a stock did not trade njo, did trade n;ı, and patterns 
of non-trading for all pairs of consecutive days nioo, Niol, niio, ni11 are reported. 


Ticker CUSIP nio Nil "ioo Tio Nio "ni 


ZMX 98991710 390 1127 129 260 261 867 
UNF 90470810 244 1273 67 177 177 1096 
JII 47936810 220 1297 70 150 150 1147 
MBC 59478010 173 1344 25 148 148 1196 
ADU 02342610 136 1381 33 103 103 1278 
LVI 50243910 117 1400 35 82 82 1318 


TABLE 3.1. Input Data for Problem 3.1.3 


Parameter Estimates for Problem 3.1.3. Maximum likelihood estimates of proba- 
bilities 7;, m} for representative sample of infrequently traded stock are reported, together 
with estimates of their standard deviations o. 
Ticker Îi Gr, Ly si 

ZMX 0.327 0.108 0.769 0.071 

UNF 0.275 0.143 0.861 0.081 

JII 0.318 0.145 0.884 0.087 

MBC 0.145 0.216 0.890 0.087 

ADU 0.243 0.200 0.925 0.102 

LVI 0.299 0.202 0.941 0.114 


TABLE 3.2. Resulting Statistics for Problem 3.1.3 


Then AP; is a four-state Markov process where the quasi-state AP; = 0 is in fact two 
distinct states according to whether the pair (7,1, I4) is (—1, —1) or (1,1). In the steady 
state, we have the following transition probability matrix: 


0 p 1-p 
(1—p)g(1—4) P2a-a+r?a-») p(1—-p)(1—4) 
(83.4.2) p»(1—a)-a(1—p) p(1—-a)da(1—p) p(1—a)-ca(1—p) 
l-q q 


The moments of AP; are then 


E[A P; = 0, 
2 e en 
(83.4.3) Yard ci oes a). 
2-p-q 
: = = 
CHARLA ua ¿CEA 
2-p-q 
(1,)0 (1 =p» =1), x20, 
et e k—1 = " i 
CHP AB Ze DO in gg, 


2(2 — p — q) 


Observe that the first autocorrelation coefficient equals —1/2 as in the IID case, but the 
higher-order autocorrelations are nonzero in general. 
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Solution 3.5 


We will show how discreteness can influence and bias several popular stock price statis- 
tics. Consider a stock with a virtual price process that follows a continuous geometric 
Brownian motion, with a net expected annual return and standard deviation of return 
(not continuously compounded) of u = 10% and o = 20%, respectively. 

Assume the observer has available daily-sampled prices rounded to the closest eighth 
of a dollar (or to $0.125 if the virtual price is less than $0.125) for a period of ten years. 
For purposes of this exercise we neglect the complications of non-trading days and assume 
the year consists of 253 equally-spaced trading days. 

We shall focus on the estimator ji; of the expected annual returns defined as rescaled 
arithmetic average of daily returns, and the estimator ô of the volatility of annual returns 
defined as a rescaled standard deviation of daily returns. The rescaling is as follows: the 
average of daily returns is multiplied by the number of trading days 253, and the standard 
deviation of daily returns is multiplied by V253. While such estimators might be suitable 
for slowly-changing and continuous price processes, they are badly biased estimators of 
the theoretical 1096 expected return and 2096 standard deviation, respectively. 

Expressing the parameters of the underlying geometric Brownian motion process as 


ams 
(S3.5.1) p = log EINER 
(83.5.2) o = log(l+o/(u+1)) 


and running 4,000 replications of the simulation described above, we report the means of 
the statistics for a hypothetical stock with various initial prices in the Table 3.3. 

Estimates of return, standard deviation, and autocorrelation are highly biased for 
low-priced stocks. Indeed, the hypothetical $0.25 stock exhibits apparent return of almost 
50%. For higher stock prices the discreteness biases subside. Nevertheless, we see that 
even for high-priced stocks the estimates are still biased due to the way we rescaled daily 
estimates to yield annual figures (these estimates would be unbiased if we had assumed 
arithmetic instead of geometric Brownian motion). 

Problem 3.5 shows that the effects of price discreteness can be substantial for stock- 
return statistics and that appropriate care has to be taken to avoid such biases. 


Solution 3.6 


3.6.1 From the histogram of IBM transaction stock prices on January 4th and 5th, 1988 
(Figure 3.1) we observe price clustering around $120 and $123. These clusters correspond 
to trades taking place on different days. 

On the other hand, the histogram of price changes (Figure 3.2) does not exhibit any 
apparent clustering, leaving aside the discretization to eighths of dollars (or “ticks”), i.e., 
the smallest price variation possible from one trade to the next. We see that most of 
changes fall in the range from —2 to +2 ticks. 

When we compare the two histograms of price changes conditional on prices falling 
on an odd or an even eighth (Figure 3.3), we see a different pattern: there are fewer 
zero-tick price changes that fall on odd eighths than on even eighths, and relatively more 
one-tick price changes that fall on odd eighths than on even eighths. Overall, even-eighth 
prices are significantly more frequent then odd-eighth ones. These regularities underscore 
the potentially important impact that discreteness can have on statistical inference for 
transactions data. 

3.6.2 The histogram of times between trades for IBM stock (Figure 3.4) shows that the 
majority of trades take place within intervals shorter than one minute. Based on n = 2,746 
time intervals, the estimate of the expected time between trades is ha = 16.86 and the 
estimate of the standard deviation of the time between trades is Ga = 19.46. The 95% 
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Histogram for IBM's Stock Price 
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FIGURE 3.1. Histogram for IBM Stock Price 
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FIGURE 3.2. Histogram of IBM Price Changes 
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Histogram of IBM's Stock Price Changes Falling on Even Eighth 
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FIGURE 3.4. Histogram of Times Between Trades for IBM 
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Simulations Results for Problem 3.5. The impact of discretization of prices to 
a $1/8 grid on naive estimates of annual mean and standard deviation based on daily 
returns data is simulated for a hypothetical stock following a continuous time geometric 
Brownian motion price process with an annual expected return of 10% and an annual 
standard deviation of 20%. For a low-priced stock discreteness biases are substantial. 
For a high-priced stock the main source of bias is the misspecification of the process for 
purposes of estimation, e.g. taking arithmetic means instead of geometric. The statistics 
are based on 4,000 replications of 10 years of daily price data for each row. 


Initial price Expected Return Standard Deviation Autocorrelation 


0.25 0.4949 0.9170 -0.2609 
(0.0043) (0.0049) (0.0010) 
0.50 0.2893 0.6454 -0.2821 
(0.0012) (0.0021) (0.0007) 
1.00 0.1847 0.4537 -0.2886 
(0.0006) (0.0012) (0.0005) 
2.00 0.1327 0.3236 -0.2734 
(0.0007) (0.0008) (0.0005) 
5.00 0.1038 0.2210 -0.1562 
(0.0008) (0.0003) (0.0009) 
10.00 0.0977 0.1918 -0.0562 
(0.0009) (0.0001) (0.0006) 
20.00 0.0963 0.1834 -0.0167 
i (0.0009) (0.0000) (0.0003) 
50.00 0.0969 0.1808 -0.0033 
i (0.0009) (0.0000) (0.0003) 
0.0934 0.1805 -0.0010 

100. 
00:00 (0.0009) (0.0000) (0.0003) 


TABLE 3.3. Simulation Results for Problem 3.5 


confidence interval for the expected time can be therefore estimated as 
(53.6.1) (ha — 1.9664 /n'?, ha + 1.9664 /n ^) = (16.23, 17.49) . 


Suppose that trade times follow a Poisson process with parameter A. That is, assume 
that the probability Pr of exactly k trades occurring during any one-minute interval is 
given by 


(S3.6.2) PR=e’-. 


The sample average time between trades fia is a sufficient statistic for A; in fact, A= 60 / fia 
is a consistent and efficient estimator of A. Note that the number 60 is the result of 
rescaling time from seconds to minutes. For our sample, À — 3.56. We can map the 9596 
confidence interval (83.6.1) of pa derived above into the following 95% confidence interval 
for A 


60 60 
.6. E cs 3.69) . 
(up) (i ici) ee) 
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IBM's Stock Price Path, Jan 4, 1988 
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FIGURE 3.5. IBM Price and Volume on Jan 4, 1988 


Note that this confidence interval is not centered on A, but has the advantage of following so 
directly from the confidence interval for ha. As n — oo, both fia and À are asymptotically 
normal consistent estimates of pa and A. 

It also follows from the definition of the Poisson distribution that the probability of 
no trade during a one-minute interval can be estimated by 


(S3.6.4) Po =e = 0.0285 


with a 95% confidence interval of (0.0248, 0.0324). 

Dividing the two trading dates into one-minute intervals and counting the number of 
trades, we get a total of 776 minutes (excluding possible opening and closing lags each day). 
A trade occurred in 733 of them. Furthermore, 697 minutes in which a trade occurred 
were immediately preceded by a minute in which a trade occurred as well. Therefore, the 
estimate of the probability of a trade occurring within a particular minute is 0.0554 with a 
95% confidence interval of (0.0393, 0.0715), and the estimate of the probability of a trade 
occurring within a particular minute conditional on a trade occurring in previous minute 
is 0.0491 with a 95% confidence interval of (0.0335, 0.0648). Estimates of conditional 
and unconditional probabilities do not differ statistically significantly, hence we cannot 
reject the hypothesis of independence on these grounds. On the other hand, there is a 
statistically significant discrepancy between these sample probabilities and the estimate 
based on the Poisson assumption. Thus, we can reject the independence of trades in that 
sense. 

3.6.3 Plots of price and volume against time-of-day for both days exhibit certain patterns 
(Figures 3.5, and 3.6). Price discreteness is visible from its price path; volume exhibits 
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IBM’s Stock Price Path, Jan 5, 1988 
123.5 T T T T T 


123r 


er 

N 

N 

a 
T 


Price in Dollars 


122r 


121.5 L L 1 L L L L 
9 10 11 12 13 14 15 16 17 


Day Time in Hours 


IBM's Stock Trading Volume, Jan 5, 1988 
250 T T T T T T T 


E 
a 
o 


Volume in 100s of Shares 


o 


E OCA 15 16 17 
Day Time in Hours 


o 
af 
E 
A 
= 


FIGURE 3.6. IBM Price and Volume on Jan 5, 1988 


large skewness and kurtosis; there is apparently less volume around lunchtime. Time-of- 
day phenomena are probably untestable from a sample of two days. There is no apparent 
relationship between price movements and volume visible by naked eye. 

Consider the simple qualitative hypothesis that large-volume trades are accompanied 
by price movements of different magnitude than small-volume trades. Let us partition 
sample of 2,746 trades into ny = 42 block trades (trades that are greater than or equal to 
100 round lots) and n, = 2,704 smaller trades, and compute the sample means bs, ôs and 
standard errors of absolute price changes immediately following the trades, expressed in 
dollars: 


(53.6.5) = = 0.0446 (0.0102) 


= 0.0675 (0.0019) . 


The difference of these averages is 0.0229 with a standard error of 0.0104, which is signif- 
icantly different from zero at the 596 level. Therefore, trading volume is indeed linked to 
subsequent price changes. Note that block trades are followed by smaller price changes 
than the majority of small volume trades. 

3.6.4 Consider the following simple model for estimating the price impact of selling IBM 
stock. Assume that we cannot distinguish whether a trade was “seller-initiated” or “buyer- 
initiated" from the data, so that we will relate only the absolute magnitude of trading 
volume to the absolute magnitude of price change as in the previous part. Moreover, as- 
sume that the (absolute) price impact of a trade is proportional to volume, ceteris paribus, 
and that errors of measurement are, after division by volume, independent and identically 
distributed. Under these strong but simple conditions we can estimate efficiently the co- 
efficient of proportionality p between volume and its price impact as the sample mean f 
of ratios of absolute price changes to volume, according to the Gauss-Markov theorem. 
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Histogram of IBM's Bid/Ask Spread on Jan 4, 1988 
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FIGURE 3.7. IBM Bid/Ask Spread Histogram on Jan 5, 1988 


In our data f = 0.0310 (in dollars per one round lot) with a standard error of 0.0013. 
Thus, we can conclude that the seller of one round lot effectively pays three cents per 
share less than marginal seller. This amount becomes economically interesting in the case 
of block trades where the loss is of the order of $3 per $120-share of stock. 


Solution 3.7 


3.7.1 The structure of individual bid/ask spread data is particularly simple. During 
January 4th and 5th, 1988, only four sizes of spreads occurred among 1,327 quotes. There 
were 748 quotes with a spread of one tick, 502 with two ticks, 65 with three ticks, and 11 
with four ticks. A histogram (Figure 3.7) shows that one-tick and two-tick spreads were 
by far the most common. 

Bid-ask spread dynamics are not IID. Table 3.4 displays the empirical distribution of 

bid/ask spreads both unconditionally and conditionally on the previous quote’s spread. It 
is apparent that the conditional distributions differ significantly from the unconditional 
distribution. 
3.7.2 The question of “causality” between quote revisions and transactions is difficult to 
answer with the data at hand if we wish to take into account agents’ expectations about 
future events. Thus, for simplicity we shall consider “causality” strictly in the temporal 
sense: does an increase in the spread come before or after an increase in trading volume, 
ceteris paribus? 

First, for simplicity let us measure intensity of transactions activity at any time 
interval by the number of shares traded in that interval, independently of how the volume 
is broken up to individual trades and independently of the stock price. 

Let us partition the trading day to n = 1,... , N, roughly 15-second intervals delim- 
ited by a subset of quotes. Let variables sn, vj! and v; indicate changes in quote spread 
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Unconditional and Conditional Distributions of Bid/Ask Spreads for IBM 
stock during January 4th and 5th, 1988. Relative frequencies of bid/ask spreads condi- 
tional on preceding quote’s spread are expressed in percent. Spreads are denominated 
in ticks. 


Previous Current Spread 
Spread 1 2 3 4 


714 27.1 1.5 0.0 
40.0 51.6 7.2 1.2 
52.3 23.1 6.2 
9.1 54.5 27.3 9.1 


eNe 
= 
d 
ex 


Any 564 37.9 49 08 


TABLE 3.4. Unconditional and Conditional Distributions of Bid/ Ask Spreads 


and transaction volume related to nth interval. More specifically, let s; be UP, if quote 
spread increases between quotes delimiting nth interval, UNCH if spread does not change, 
and DOWN if spread decreases. Also, vi is UP if trading volume at interval n is smaller 
than that in n + 1, etc. Analogously, v; be UP if trading volume in n — 1 is smaller than 
that in n. 

Estimates of joint probabilities of s, v^ and v” allow some statistical inference about 
the relation between spreads and transactions. In particular, if quote revisions affect only 
subsequent transactions but do not influence previous one, the variables s and v^ should 
be statistically independent. On the other hand, if quotes reflect previous transaction 
activity, s and v* should be statistically independent. The empirical distribution of the 
27 triples [v^ , 5, v^], under assumption that their realizations at triples of consecutive 
15-second intervals are IID, allow us to test the proposed hypotheses. 

However, it is well possible that transaction activity and quote revisions influence 
temporally each other. That being the case, testing existence of unilateral causality may be 
next to meaningless. Therefore, let us test “causality” in each direction separately, against 
alternative hypothesis of no relation between quote revisions and transaction activity. In 
another words, let us test whether variables s and v^ are dependent, to see whether current 
quote revisions influence future transactions. Similarly, let us test whether variables s and 
v are dependent, to see whether current quote revision is influenced by past transactions. 

Using a standard asymptotic test of independence for a contingency table as described 
in Rao (1973, pp. 404-412), we have, under the null hypothesis of independence, that: 


T El 2 
(S3.7.1) E E 


as ni¡n.¡/n.. 
has x? distribution with (r — 1)(s — 1) degrees of freedom. In our case r = s = 3 we have 
xi. The contingency tables (Tables 3.5) provide a summary of the data. 

It turns out that x2 statistics for the first table is 13.6, and for the second 17.9 so that 
we reject the hypothesis of no dependence between s and v^ on 0.9% significance level 
and that of s and v* on 1.296 significance level. Thus we have shown that quote revisions 
“influence” future transactions, and past transactions “influence” quote revisions. 

The next step of the analysis may be to postulate a particular model that involves 
both effects between transactions and quotes, and perform another round of the statistical 
analysis. 

3.7.3 This part is very similar to 3.7.2, hence we omit the solution. 
3.7.4 Let us assume that the investor starts with the bond position and that considers 
the quotes as the relevant price information sense: the investors account only for the 
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Contingency Tables for causal relationship between transactions activity and quote 
revisions. The trading day Jan 4, 1988 is divided to quote-to-quote intervals of roughly 
15 seconds apart and changes of bid/ask spread together with changes in the trade 
volume in these intervals are counted. Tables show relationship between past /future 
changes in transactions activity against the spread change in the intervals. 


Past Current Spread Future Current Spread 
Trading UP UNCH DOWN Trading UP UNCH DOWN 


UP 23 59 10 UP 20 51 33 
UNCH 3 12 2 UNCH 2 2 2 
DOWN 21 50 32 DOWN 25 68 9 


TABLE 3.5. Contingency Tables 


rise or decline of the mid-price given as average of the bid/ask spread. Quotes that do 
not change the mid-price are effectively ignored. Further assume that at the end of the 
two-day trading period the stock position is liquidated into bonds. 

A simulation of such a trading strategy shows: (1) if the investor is allowed to buy 
and sell at the average, he is left with $101,899 at the end; (2) if the bid/ask prices are 
used, he is left with $97,769. We see that the bid/ask spread does matter. 

It would be difficult to perform any sensible statistical analysis based on the one 
simulation performed. In particular, it is incorrect to assert that the strategy in (1) that 
led to nearly a 1% return over one day would dominate a buy-and-hold strategy for a 
different data set or over a different time span. Nevertheless, the gap between the profits 
of (1) and (2) are real: frequent trading and large spreads do create significant losses 
compared to the “frictionless” case. 
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Problems in Chapter 4 


Solution 4.1 


Because OLS is consistent, we have 6; — 6; in probability as Lı — oo from (4.5.3). Thus, 
for € from (4.5.7) we have êf = Ri — X7ó; > Rj — X16; = ej in probability, as Lı — oo. 
Because abnormal returns e; are independent (across time), the sample abnormal returns 
€; are asymptotically independent as Lı > oo. 


Solution 4.2 


We assume that the cumulative abnormal return test statistics are calculated using the 
known standard deviation of the abnormal returns, that the abnormal returns are inde- 
pendent through time and across observations and normally distributed, and that the 
abnormal returns are measured without parameter sampling error (Li is large). Denote 
Lə = 3 as the length of the event window and N as the number of event observations. 
Designate group 1 as the observations with low standard deviation and group 2 as the 
observations with high standard deviation. For the group means and standard deviations 
we have pı = 0.003, 2 = 0.003, 01 = 0.03, and o2 = 0.06 where the subscript indicates 
the group. Nı = 25 and Na = 25 are the number of observations in groups 1 and 2 
respectively. 

To calculate the power against the given alternative, we need to derive the distribu- 
tions of the test statistics under that alternative. First, we aggregate the abnormal returns 
over the event window for each observation which gives 


L3 
(84.2.1) CAR: 2 d. 
l=1 


Given the assumptions, E[CAR;] = Ləpgu) and Var[CAR;] = L»o7(; where g(i) equals 
the group of observation i. 

Then, we aggregate across observations to form the test statistics (modified to reflect 
the above assumptions). The aggregation of abnormal returns corresponding to Jı in 
(4.4.22) is 


L3(Nioi + Na03) PUAN 
(S4.2.2) Jı = rs De > 


The aggregation corresponding to Ja in (4.4.24) is: 


1 CAR; 
(S4.2.3) Jz = VN N y == 
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Under the specified alternative hypothesis, the distributions of the test statistics Jı 
and J2 are 


v Lə(Nıpı + Napa) 


(34.2.4) Ao Nn D =N No? NEP D> 
«as an ossi u2 
(S4.2.5) Jh ~ a de 


Substituting in the alternative parameter values, for the means of J; and Jo we have 
pi = 0.775 and 3 = 0.919, respectively. 

Consider a two-sided test of size œ based on J; and Ja, respectively, of the null 
hypothesis Ho : [ui po] = [0 0] against the alternative hypothesis Ha : [ui po] = 
[0.003 0.003]. Using equation (4.6.1), the powers of the tests, P; and P» are 


P, = Pri «97 (o/2) + Pr[Ji > 97! (1— a/2)] 
[Sui +87 (0/2))] + [1 — (ui + $7 (1 — /2))], 
Pr[J» < 9^! (a/2)] + Pr[Jo > 9^! (1— a/2)] 

= [8(-p3 +8 (0/2) + [1 — (u$ + 9 1 — o/2)). 
Evaluation of these expressions for a = 0.05 gives P; = 12.196 and P> = 15.1%. 


(S4.2.6) 


P, 


Solution 4.3 


The solution is the same as for Problem 4.2 except that u2 = 0.006 instead of 0.003. Using 
this value for 2, we have pï = 1.162 and u3 = 1.225 giving P, = 21.3% and P» = 23.3%. 


Problems in Chapter 5 


Solution 5.1 


For the regression equation 
(S5.1.1) Ra = Bo + Bi Rop + Bo Fly + €p 


using well-known regression results, we have 


(85.1.2) Br = Cov[Ra, Rop|/ Var[Rop] = Baop, 
(S5.1.3) f» = Cov[Ra, Rp|/ Var[Rp] = Bap, 
(S5.1.4) Bo = Ha — (BaopHop + Bapitp); 


since Cov[R,, Rop] = 0. The result P2 = Pap is immediate, thus we need to show that 
B1 =1- Bap and Bo = 0 to complete the solution. 

Let r be the minimum variance portfolio with expected return equal to that of portfo- 
lio a, pa = Ur. From the form of the solution for the minimum variance portfolio weights 
in (5.2.6), Rr can be expressed as 


(S5.1.5) Ry = (1—A)Rop + AR» 


where \ = (fir — flop) / (p — Hop): Using Cov[Rp, Rop] = 0 and pr = (1 — Apop + Ap we 
have 


Brop = Cov[Rr, Rop]/ Var[Rop] 
= Cov[(1 — A) Ro, + ARp, Rop]]/ Var[Rop] 
(S5.1.6) aa 
Brp = Cov[Rr, Rp]/ Var[Rp] 
= Cov[(1 — A)Rop + AR», Rp] / Var[Rp] 
(S5.1.7) = xX 
(S5.1.8) Hr = Bropftop + brphp. 


Portfolio a can be expressed as portfolio r plus an arbitrage (zero-investment) portfolio 
a* composed of portfolio a minus portfolio r (long a and short r). The return of a* is 


(S5.1.9) Ra = Ra — Ry. 


Since a = Hr, the expected return of a" is zero. Because a* is an arbitrage portfolio 
with an expected return of zero, for any minimum variance portfolio q, the solution to the 
optimization problem 


(S5.1.10) min Var[R; + cRa*] 
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is c = 0. Any other solution would contradict q being minimum variance. Noting that 
Var[R, + cRa*] = Var[R4] + 2cCov[Ry, Rs] + c? Var[Ra+] we have 


[?) 
(85.1.11) Fe Var[Ry + cRa*] = 2Cov[Ry, Ra] + 2c Var[Ra-]. 
Setting this derivative equal to zero and substituting in the solution c = 0 gives 
(85.1.12) Cov[F,, Ra*] = 0. 


Thus the return of a* is uncorrelated with the return of all minimum variance portfolios. 
Using this result we have 


Cov[Ra, Rp] =  Cov[R. + Rax, Rp] 
(S5.1.13) = Cov[R,, Rp] 
Cov[Ra, Rop] = Cov[R, + Ra*, Roy]. 
(5.1.14) = Cov[R,, Rop] 
From (S5.1.13) and (S5.1.14) it follows that 
(S5.1.15) Baop = Prop 
(S5.1.16) Pap = Brp- 


Combining (S5.1.2) with (S5.1.6), (S5.1.7), (85.1.15) and (S5.1.16) we have G1 = Bao» = 
1 — Bap. Since pr = Ha, combining (S5.1.4) with (S5.1.8), (S5.1.15), and (S5.1.16) gives 
Bo = 0 which completes the solution. 


Solution 5.2 


Begin with the excess return market model from (5.3.1) for N assets. Taking unconditional 
expectations of both sides and rearranging gives 


(S5.2.1) a = h — Blm. 


Given that the market portfolio is the tangency portfolio, from (5.2.28) we have the (N x1) 
weight vector of the market portfolio 


1 -1 
Using wm we can calculate the (N x 1) vector of covariances of the N asset returns with 
the market portfolio return, the expected excess return of the market, and the variance of 


the market return, 


1 
IQg-1 
nm PO 
(85.2.4) Um = Wi = JOT 
uou 
— ! — 
(55.2.5) Var[Zm] = wm wm = WO 
Combining (85.2.3) and (S5.2.5) we have 
Cov[Z, Zm] _ OT u 
2% a — A A 0 
(S5.2.6) Bm Vaz.) w'ü ig" 
and combining (85.2.6) and (85.2.4) we have 
(55.2.7) Bum = p. 


From (S5.2.1) and (85.2.7) the result a = 0 is immediate. 
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Solution 5.3 


The solution draws on the statistical analysis of Section 5.3. The calculations for three 
selected stocks are left to the reader. 


Solution 5.4 


Let Z} be a (N+1x1) vector of excess asset returns with mean p* and covariance matrix 
Q*. Designate asset N +1 as the market portfolio m. Assume that Q* is full rank. (If the 
market portfolio is a combination of the N included assets, this assumption can be met 
by eliminating one asset.) 

From (5.2.28) the tangency portfolio q of these N +1 assets has weight vector 


1 x*—]1 x 
(85.4.1) Wy = ra im 
Using straight forward algebra we have 
2 I ,,*\2 
(55.4.2) Ha (Ho) _ EN 


2 NO 
02 wwa 


The covariance matrix Q* can be partitioned in the first N assets and the market portfolio, 


Q Bo}, 
(S5.4.3) or = 
Blom Om 
(S5.4.4) 
| Bom +E om ] 
(S5.4.5) = 


| ga o | 
where Q = G8'o?, + Y is substituted. 
Using the formula for a partitioned inverse (see Morrison (1990) page 69) we have 


kh xa X B 
(85.4.6) Qt = | gn = 4 8'x-'B | 
(S5.4.7) 
Using u” = |u’ um] and (85.4.6) we have 
2 
(85.4.8) pOT p = P + (p Bus) (1 — Bus) 
Substituting œ = p — Bu gives 
2 

(85.4.9) uo — aX a. 

Om 


From ($5.4.2) and (85.4.9) we have 
2 
(S5.4.10) gy e. Em 


which is the result in (5.5.3). 
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Problems in Chapter 6 


Solution 6.1 


Let the number of portfolios in the set be K and let Rx: be the (K x 1) vector of 
time period t returns for the portfolios. Since the entire minimum variance boundary can 
be generated from the K portfolios, for any value of the constant ut, there exists a combi- 
nation of the portfolios with expected return p! which is minimum variance with respect 
to the K portfolios plus the N assets. Choose mi to be any value but the global minimum 
variance portfolio expected return (see equation (5.2.11)) and denote this portfolio op. 
Corresponding to op is a minimum variance portfolio p whose return is uncorrelated with 
the return of op (see Section 5.2). Since p and op are minimum variance portfolios their 
returns are linear combinations of the elements of Rx, 


(S6.1.1) Ra = Rewer 
(86.1.2) Ropt => Robo: 


where wr and wk, are (K x 1) vectors of portfolio weights. Because p and op are uncor- 
related minimum variance portfolios, we have 


(S6.1.3) H = Llop + By (Hp — Hop) 
where 
Cov[R:, R 
8o R, pt] 
P 


1 
= a Re Rice | 


(S6.1.4) = E Rd? . 
p 
(See Section 5.2.) Substituting (86.1.4) into (86.1.3) gives 
(S6.1.5) U = thop + Cov[Ra, Riu We — Ber) 
P 


Analogous to (S6.1.5) for the K portfolios we have 


(S6.1.6) Mx = thop + Cov[Rxe, RA Jof Uto — Hew). 


Tp 


Rearranging (S6.1.6) gives 


(S6.1.7) wi We — Hor) = Cov[Rkt,Rxı) (Mg — thop). 
p 


Substituting (86.1.7) into (86.1.5) gives 
(86.1.8) U = Llop + Cov[Rı, Rx.Cov[Rkt, Ric] (ux — thop). 
Now consider the multivariate regression of N assets on K factor portfolios, 


(86.1.9) R =a+BRx:+e 
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where a is the (N x 1) intercept vector, B is the (N x K) matrix of factor regression 
coefficients, and e; is the time period t residual vector. From regression theory we have 


(S6.1.10) B = Cov[R;,Rx¿]Cov[Rx+, Rk] 
(S6.1.11) a = p-Bpy. 
From (S6.1.8) and (86.1.10), we have 

u =  upop + BULK — thop) 
(86.1.12) Bl Air 


Since (86.1.12) holds for different values of pop it must be the case that (+ — Be) = 0, 
that is the factor regression coefficients for each asset, including asset a, sum to one. If 
(¿— Be) = 0, then (86.1.12) reduces to u = Bug and thus from (86.1.11) we have a = 0, 
that is the regression intercept will be zero for all assets including asset a. 


Solution 6.2 


Let u* and Q* be the mean excess return vector and the covariance matrix respectively 
for the N assets and portfolio p, 


ALA 
(86.2.1) = 
à | Hp | 
Q B'o 
(S6.2.2) o = 
Bo, c, 
(S6.2.3) 
| BB; +E B'o? ] 
(86.2.4) = 
| B; m | 


where Q = Aß'o, + E and X = óó'o, + 107. 

Given the N +1 assets, the maximum squared Sharpe ratio is *'0*7* p^ which is the 
squared Sharpe ratio of the tangency portfolio. As demonstrated in problem 5.4, given 
u — a + Bp, and Q = 8c; + Xi this ratio can be expressed as 


2 
(S6.2.5) sp = pO ps — PP pasa 
Tp 
where s7 is the maximum squared Sharpe ratio for economy I, I = A, B. Analytically 
inverting Y = 90'0; + lo? and simplifying, s? can be expressed as 


2 = £2 A 1 oj(a 5) 
(S6.2.6) Sq] = 8 + 72 (aa + G2 +078)!" 


where So is the squared Sharpe ratio of portfolio p. 


Solution 6.3 


Using (86.2.6) and the cross-sectional distributional properties of the elements of a and 0, 
an approximation for the maximum squared Sharpe measure for each economy can be de- 
rived. For both economies, ya'a converges to 02, and LIO] converges to o2. For economy 
A, qz (a'0)? converges to oa, and for economy B, 4(a’6)” converges to o4. Substituting 
these limits into (86.2.6) gives approximations of the maximum squared Sharpe measures 
squared for each economy. Substitution into (86.2.6) gives 


2 
No, 


S6.3.1 e gl ARS 
( ) P mw o2 + Nojo2 
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2,2 
30 SN ese el, 

(5690) = gm c2 | o2+ Novo? 

Thus we have the squared Sharpe ratios for economies A and B, respectively. The squared 

Sharpe ratios for large N follow from (S6.3.1) and (S6.3.2). For economy A we have 


1 


(S6.3.3) sj = 5, +s, 
Th 
and for economy B we have 
2 
(S6.3.4) sh = Ż + N[]. 


The maximum squared Sharpe measure is bounded as N increases for economy A and 
unbounded for economy B. Examples of economies A and B are discussed in section 
6.6.3. 
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Problems in Chapter 7 


Solution 7.1 


Each period, the corporation repurchases shares worth AX while the total stock is worth 
V = X/(1L-(1+R)"') = (1 + R)X/R. Therefore, the number of shares outstanding 
follows the “law of motion” 


R 
S7.1.1 Niy = | 1 — A—— | N.. 
(87.1.1) id ( = z) ; 
7.1.1 Price per share is P; = V/N;, and dividend per share is D; = (1 — A)X/N¿41. 
(Note that dividends are paid after repurchases, on the remaining shares only). Hence the 
growth rate of dividends per share, G, satisfies 


N; I+R 
57.1.2 1+G = — = —. 
( ) Ni 1+R(-A) 

Dividends per share grow, even though total dividends do not, because the number of 
shares is shrinking over time. 

7.1.2 The dividend-price ratio is 


(87.1.3) DP = D./P; = PENA ( Ne ) l 


Manipulating this equation yields 


R-G 
.1.4 DP = —— 
DOES I+R’ 


which is consistent with equation (7.1.9) after accounting for the fact that prices here are 
cum-dividend, whereas the discussion in the text applies to ex-dividend prices. 

7.1.3 This follows immediately from the results of the previous subsection, because if price 
is the present value of discounted future dividends, including the dividend paid today, then 


(87.1.5) Pay Pe O (4), 


which is what we showed in the previous subsection. 

Intuitively, shares must have the same value to shareholders who sell shares to the 
repurchasing firm and to shareholders who do not. A shareholder who sells a fraction A 
of his shares to the firm each period receives a constant fraction of total dividends and 
repurchase payments, that is, a constant fraction of the firm's cash flow. A shareholder 
who sells no shares to the firm receives a growing fraction of total dividends, because the 
total number of shares is shrinking over time. The value of the shares is the same in either 
case. 


33 


34 PROBLEMS IN CHAPTER 7 


Solution 7.2 


7.2.1 Denoting the discount rate R = e”, we can write 


oo oo 
D; n a 
(57.2.1) F = E;[ 5 = fetter (ettin ?) 
T=t+1 n=1 
oo 
2 
= D: 5 ¿alto? 21) 

n=1 


The condition p +0? /2 < r is necessary for the sum to converge. It follows that the ratio 
of fundamental value to dividend is 


0?/2r 
(S7.2.2) n 
7.2.2 Since 
(S7.2.3) Fi -cD? = e" E[Frji + cD] + Doa] 
and 
(S7.2.4) Fy = e "Es[Figa + Dua] 
we have 
(S7.2.5) De “E Dial 
Since 
(S7.2.6) E,[D?, i] — eErlAdı+1l+ Var Adırı)/2 _ gas e718 


we get a quadratic equation for the parameter A, 

(S7.2.7) No? /2+Ap—r =0. 

For such a parameter A, the price process P; = F; + cD} indeed gives the same expected 
rate of return as the process P, = Fi. 

7.2.3 The Froot-Obstfeld bubble requires a very specific dividend process. However, the 


bubble is strongly correlated with the dividend, capturing the effect of dividend “overre- 
action". The bubble never bursts for a strictly positive dividend stream. 


Solution 7.3 


7.3.1 Using approximate formula (7.1.30), we have for k > 1 


Cov[re, rix] 


p& P&t+k 
E || 21-1 + qaa — ) (zu: T Nat+k — 3] 
( ‘T= pó 1 — pọ 


(S7.3.1) 


pét 
= Elt-ı%C 4-1 — 1 : 
| 1— pó 


Because x: = J p o 9" &i-a, we have 


go? 
(S7.3.2) Eleı-ı214r-1] = Tor 
and 
(57.3.3) El&wı4r-ı] =9* 0%. 
Thus the return autocovariance is 
3.4 EEE ER s 
(87.3.4) Covlre, rex] = $ (s = 128 Cz 


This is negative when ¢ < p. The autocorrelation of stock returns is determined by the 
balance of two opposing effects. Expected stock returns are positively autocorrelated, 
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and this creates positive autocorrelation in realized stock returns. However innovations 
in expected future stock returns are negatively correlated with current unexpected stock 
returns, and this creates negative autocorrelation in realized stock returns. The latter 
effect dominates when ¢ < p. 

7.3.2 Assume now that 


(S7.3.5) Cov[na,t, 6i] = Ong > 0. 

We have 

87.3.6 C LEN. E: 
( ) ov[rt, rci] $ | 02 + 1-9? 1— pọ TE 


If o„,e is large enough, the first term can dominate the others, giving positive return 
autocovariances. 


Solution 7.4 
7.4.1 Equation (7.1.19) implies that for p = vr, 


(87.4.1) Tii S k + ppm + (1— p)dea — pe = k + pp + pers. 


We see that r;44 is just a constant plus a white noise component — the log stock return 
Ti41 is therefore unforecastable. 

7.4.2 Let us rewrite the formula for v; and substitute in the dividend rule. We get 
1-A C nm 


(w—1 — dt-1) Hp —— — — +4, 
pop 


(57.4.2) Ut — di = 


so the log dividend-price ratio d; — v; follows an AR(1) process with persistence coefficient 


(1 —d)/p. 
7.4.3 The log dividend-price ratio is 


(57.4.3) di — p= di — (vi — (dt — vt)) = a + Y)(dı — Ut). 


Because d; — v; is an AR(1) process and the log dividend-price ratio d; — p: is a (positive) 
multiple of d; — vs, it is also an AR(1) process. 
The approximate log stock return can be rewritten, using the formulas for d; and pt, 


as 
Tipi = k- p(pia — dea) + (dea — pt) 
= kpl +y) (vi —dia) to+ (1 A+ y)(di — vi) 4-44 
A 
(57.4.4) = constant + ixl — pi) — ymi+1  p(1 + Ye-ı- 


Setting x+ = (Ay/(1+Yy))(dı — pi), we get a model of the form (7.1.27) and (7.1.28), with 
x: being the optimal forecasting variable for r;+ı up to a constant. 
7.4.4 From the above we have 


(S7.4.5) re+ı — Ex[reqa] = —yneti + pl + vera, 
and 
(S7.4.6) L+1 — Ei [e141] = Ay (= = 23 > 


so that the covariance of the innovations is always negative. 
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Solution 7.5 


Let us denote the expectation conditional on the full information set at time t as E:[-] 
and the expectation conditional on information J; as Ej,[-]. Thus, we have E[E;[-]] = 
E[], Er, [E¢[-]] = Ez,[-], and so forth, by the law of iterated expectations. In particular, 
E[p;] = E[E«[p?]] = Elp?]. Note that the following “prices” are expectations listed in order 
of decreasing conditioning information: pi, pe = E:[p}], f: = Ez, [pi], and Elp?]. 

7.5.1 Calculate 


Var] = Ell(Eelpi] — Ex. [pt] + (Es, [pi] — El] 
(S7.5.1) = ElEılpi] - Es[pt]) ] + El(Es, [pi] — Ep: ] 
> E((Es, [pi] - E[Es, [2:17 
= Var[p:]. 
where the cross term at the second step was eliminated using the fact that 
(S7.5.2) E[E ;, [(Es[p:] — Es. ler) Ei pe] — Elpe])]] = 0 


as Ej, [p?] — Er. [pz] conditional on J; is a constant and Ey, [E«[p;] — Ez, [p?]] = 0. Calcu- 
lations in other parts of the problem are similar. 

Intuitively, a price forecast based on less information is less volatile. 
7.5.2 Calculate 


Varlpí — ô] = Elo — pe) + (p: — 02) ] 
(87.5.3) = El(pt — 901 + Ellpı - $07] 
= Var[p; — pi]. 
It follows that 
(S7.5.4) Var[p; — fi] > Var[p: — ji] 
and 
(S7.5.5) Varp — fi] > Var[p: — Di] 


as was to be shown. 

Intuitively, a forecast based on inferior information has a larger error variance. Also, 
the error variance for a forecast of the actual realization of an uncertain price is larger 
than the error variance for a forecast of a superior-information forecast. 

Stock prices, referred to in Problem 7.5.1, are usually considered nonstationary so 
that their conditional variances do not converge to a finite unconditional variance. On 
the other hand, forecast errors, referred to in Problem 7.5.2 may plausibly be assumed 
stationary. Therefore, the framework of Problem 7.5.2 seems more suitable for econometric 
analysis. 

7.5.3 Note that Pt+ı is defined to be Ey, [pz,1]. Using the approximation 


(87.5.6) Elri41] = ElEs, [rei] = Elf] 
that follows from (7.1.19) we get 
Var[riga] = ERr — Pega) + (Pega — Ef] 
(87.5.7) = E[(rena — Pt4ı)’] + Ea — EG] 
> Var[#ı +1]. 


Intuitively, the variance of a return forecast is less volatile than the return itself. Just 
as in Problem 7.5.2, this result is more useful than that in Problem 7.5.1 because the 
stochastic processes for returns do not seem to have the unit roots characteristic of price 
processes. 


Problems in Chapter 8 


Solution 8.1 
8.1.1 Recall that 837 = O^! (1 — ME[: + R}]), Mí = M + (Ri — E[Ri]) gr. Therefore 
(S8.1.1) EIM/(M)?] 2 M? + 80837 — M^ + (1 — MEL + RIO (0. - = + Ri). 


Note in particular that EIM/(M)?] > M. 

First, we will show that in the market augmented by a risk-free asset with return 
14 Rr, — 1/M, there exists a benchmark portfolio with return 
M}(M 
(58.1.2) 1+ Ra = A, 
E[M? (M)?] 


Consider a portfolio with dollar weights Byr on the risky assets and M - Ele + Fu] Gr 
on the risk free asset. Such a portfolio has payoff Mi (M) and value 
(Br M^ — Ele + Re]! Brr 
(S8.1.3) = M +- ME t+ RIU (0 — MEL + Ri) 
= EM). 

Thus the portfolio return is exactly 1+ Rss, and the proof of existence is complete. — — 

Next, consider any portfolio Rp: such that E[Rp:] = E[Fs;]. The properties of M¥ (M) 
and Ro: imply that 

Var[Ry] = E[((Rpt — Roe) + (Roe — E[R»:]))*] 

E[(Rpt — Ro) ] + El(Ros — E[Fac])] 
Var| Rpt — Rot] + Var[Rot]. 


(S8.1.4) 


The only nontrivial step was to eliminate the cross term 
El(Rp+ — Rot) (Rot — E[Roe))] 
Mi (M) 


(88.1.5) ELM; (M)?] 


E[Rp+ — Rot] — E[Rp: — Roe] E[l + Ros) 


= 0. 


Thus Var[R,ı] > Var[Rot] whenever E[Rp:] = E| Rit], so the benchmark portfolio is on the 
mean-variance frontier. 

8.1.2 First note that E[(M;(M) — Mz (M))(1 + Rp+)] = 0 for any portfolio return Ry. It 
follows that 


(S8.1.6) Cov[M; (M), Rpt] = Cov[M? (M), Ry] < Var[Mz (M)] ? Var[Rpi]'””, 

where equality is attained iff Mí (M) is perfectly correlated with Rp:. In particular, we 
have Cov[M; (M), Ry] = Var[Mz (M)]'? Var[Ro:]'/? and therefore 

Var[M? M)? 


(S8.1.7) Corr[M:(M), Roe] < Var[Mi(M)]!/? 


= = Corr[Mi( M M), Roi), 
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so that Rə: is a maximum correlation portfolio among all Rpt’s with respect to any sto- 
chastic discount factor M¿(M). 
8.1.3 From E[M¿(M) (+ Ris)] = 1 we get 


(S8.1.8) Cov[M; (M), R4] = 1 — EIM/ (M)JE[1 + Rit] 
and similarly for Ry. Therefore 


Cov[M? (M), Ru] 1—E[M¿(M)E[L + Ri] 


(S8.1.9) Cov[Mrz(M),Ru] 1-EIM-(MJEIL+ Ru] 


Note that E[Mz (M)] = M and M7 (M) = c(1 + Ry), where c = E[M? (MY]| ! >0 isa 
constant. Thus, the above expression simplifies to 


Cov[Rs:, Rit] = 1/M —E[1 + Ri] 


88.1.10 = YE 
( ) Cov[Rs; , Fi] 1/M — E[ + Ri] 


which yields (8.1.17). 


8.1.4 Indeed, 
m Mt (M) M 
(S8.1.11) Bil + Ry) E || = EDuz Ty 
and 
(m) = [xm IA Rw) MD » 
oO bt = — 
E[Mj (M)?] (E[Mr (M)?]) 
(ein: any - wh) 
(88.1.12) EOM NE P PM MUN SN 
E[Mz (M)?] 
so that 
Ru) | (EMAD ^ 
(88.1.13) pu M (SEE COT 1) 
Similarly, 
— EIM} M)’ _ 7 
(S8.1.14) 1/M =E[L + Re] = Delo ERE D 
c (BIN; a - 7”) 


ES (yl ») ul 


el 


8.1.5 Note that 


(S8.1.15) E[M,(M)] = M = E[Mt (M)] 
and 

EIM.(M)’] = EI(M/(M) + a a ] 
(S8.1.16) = E[Mi(M)] + E[(M.(M) - Mz (D) ] 


)] 
> E[M; (M1). 
Therefore o (M¿(M)) > o (M;(M)) and finally 


o(Ry) | 6 (Mi ( 


(S8.1.17) Ell + Ro]  E[M;( 
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Solution 8.2 


8.2.1 Assume a representative-agent utility function as in (8.2.2), 


oo a 07 oo . 
(S8.2.1) u((C) = s => =), wc), 
jo ^ 7 j= 
and consider the maximization problem 
(S8.2.2) (hate E«[u ($C )] 


subject to Wii + Ca = wit + Rı+ı)Wı and W; >0. 

Consider a single intertemporal sub-problem involving incremental investment of an 
amount x in a specific asset 7 from period t to t+1, at the cost of time-t consumption, 
the proceeds of the investment to be consumed at t +1: 


(S8.2.3) maxE; [v(C; = x) + óv(Ci+1 + x(1 + Ri1+41))]. 
At the optimum of the previous problem, x = 0 has to be optimal here, so that 
[o] 
(88.2.4) E [v(C; = x) + bu(Cr41 + x(1 + Rit41))] = 0, 
x=0 


which implies 
(S8.2.5) E; [C;? + ó(1 T Rit+1)C 7] =0, 
from which (8.2.3) follows. Note that (8.2.3) is not only necessary but also sufficient for 


the optimum once it holds for all ?s and t's. 
Assuming that asset returns and consumption are jointly log-normal, the quantity 


(88.2.6) (1 + Rijt41)6(Cr41/Cr) 7 
is also log-normal and therefore by taking logs of (8.2.3) 
(88.2.7) 


Ey [log (1 + Ri,t41)5(Cr41/C1)~7)] + > Varılog (1 + Ri,e41)6(Cr41/Cr) 7)] = 0, 
so that 
(S8.2.8) E[r; t+] + log 6 — yE¿[Aci+1] + 
5 (Vare[ri; +1] + y? Vari [Aci] — 2yCovelri iia Ac++1]) =0 


which gives (8.2.5). 
Assuming that conditional variances and covariances 


Vi; = Var[rit+], 
(S8.2.9) Voc = Var¿[Ac¿+1], 
Vie = Covelrii41, Act+1] 


are all constants, we can write (8.2.5) as 
1 
(S8.2.10) Ei [ri t+] = Ei [Aci] + (- log ô = 3 (Vii + Y Va. = 2) , 


which is a linear function of Eı[Acı+ı] with slope coefficient y—the coefficient of risk 
aversion for the power utility function. This solves part (i). Subtracting (8.2.6), the 
riskfree asset equation, we get as in (8.2.7) 


1 
(88.2.11) Eiri t+ — rfi] + 5 Vii = Wie, 


so that the “premium” of the asset is proportional to the conditional covariance of the log 
asset return with consumption growth, with coefficient of proportionality y. This solves 
part (ii). 
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8.2.2 Part (i). Let aggregate equity e pay a log dividend equal to log aggregate con- 
sumption, so that Ade, = Acı. From the previous part we know that E;[re.:+1+;] = 
YE:[Acı+ı+;], up to a constant. Then (7.1.25) implies that 


(S8.2.12) 


oo 
Te,t414j — Etfe tit = Acip — EtAc + (1 — y) p [Et Acip145 7 EsAci4143) - 


j=1 
By simple algebraic manipulation of the process for Acı+ı, we obtain the following 
expression for Acris: 


(88.2.13) Ace; =H (E 4) + git Aci + > 9 utip, 
i=0 i=0 
so by the Law of Iterated Expectations we have that 
(S8.2.14) E¿Aci4145 =p > 4) + pt Ac 
i=0 
and 
(S8.2.15) Bey 1 Aci4145 =p (E «) + di Ac + dui, 
i=0 


which in turn implies 
(S8.2.16) Ein ACity = Bir Act+14; = di uia. 


Substituting in this expression, and noting that Acızı — Ez: Aci+1 = ut+1, we obtain 


oo 
Teteitj —Exrettity = w (1-7) Y P u 
j=l 
(S8.2.17) = Urt a = y) p? Ut41 
1— pó 


1 — yp% 
(E)n 


Part (ii). For a real consol paying a fixed real dividend we have that Ad; ¿4145 = 0, 
so the unexpected return is influenced only by changes in expected future interest rates. 
Similar reasoning as in part (i) gives the unexpected real consol bond return as 


—Ypó 
$8.2.18 ;— E — ; 
( ) Tb, t+1+j tTb,t4-14-j (i = 2 Ut+1 
8.2.3 Part (i). From equation (8.2.7), the equity premium is given by yVce, where 
Ve = Cove (Acti — Er Acti, Te t41 — Etre,t41) 
ja 
(S8.2.19) = Cov c kun) 
1-— pó 
_ 1-?p$» 
== 1 Em pd Ou, 


and we may write Cov(-,-) instead of Cov;(-,-) because the process for Acı+ı is ho- 
moskedastic. 
Similarly, the consol bond premium is YV.», where 


(S8.2.20) Y N 
1— pó 
Part (ii). The bond premium has the opposite sign to ¢ because a positive ¢ implies 
that a positive endowment shock increases future consumption more than current con- 
sumption, so real interest rates rise and bond prices fall when consumption rises. Real 
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bonds thus provide a hedge against endowment risk and they have a negative premium. 
The bond premium is proportional to the square of y because a larger y both increases 
the variability of real interest rates and bond returns, and increases the premium required 
by investors for bearing a unit of risk. 

Part (iii). The premium of equity over the consol is 


You 
(8.2.21) ey (Vee — Vie) aa) 
so the equity premium is just the bond premium plus a premium related to dividend 
uncertainty, which is always positive and proportional to y. 
Part (iv). The lesson for the equity premium literature is that models with high 
degrees of risk aversion tend to imply a high bond premium as well as a high equity 
premium. This is a counterfactual implication. 


Solution 8.3 


8.3.1 The second-period endowment is m with probability 3 and (1—a)m with probability 
3. This can be written as 


ula 


(S8.3.1) sel 
(1— a)m, 


Ne 


where w is both the individual and aggregate second period endowment. 
Consider buying e of the asset. The asset price is paid in the second period, so the 
expected utility cost is 


(S8.3.2) ;U' (m) pe + su’ (1 —a)m)pe = pe E 4 on 


The expected utility gain is 


1 1 1 
(S8.3.3) zu (m) me + SU (1 — a) m) (1 — a) me = 3€ [1 +1] 
(S8.3.4) =€. 
In equilibrium, the expected utility cost must equal the expected utility benefit, so 
1 2—a 

58.3.5 pe | — | = 

(88.3.5) ae | =« 

which implies 

2(1-a) 


The expected gross return on the claim is the ratio between its expected payoff and 
its price: 
gm+z(l-a)m _ 22 
p 4-8)" 
which rises with a (a measure of aggregate risk) as we would expect. 
8.3.2 Now we have that the individual second period endowment is: 


(S8.3.7) 1+R® = 


(S8.3.8) w= m, 
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while the aggregate endowment is still as before: 


tI 


m, 
(S8.3.9) w^- 


(1— a)m, 3 


Note that we must have b > a so the individual endowment is always non-negative and 
log utility is defined. If b = 1 then we are back to the previous case. 

Since all agents have the same utility function and face the same probability of being 
in each group, they all have the same expected endowment and are identical ex-ante. 
However, ex-post their endowments will differ,so there will be ex-post heterogeneity. 

As before, consider buying e of the asset. The expected utility cost is 


1,, 1 i 1. a 
(S8.3.10) gU' (m) pe + z (1 — b) U' (m) pe + 58U ((1 - 3 m) pe 
_ lpe n b _ lpe a 
on + a Dtr an Pn] 
The expected utility gain is 
1 1 1 $ 1 7) a 
(S8.3.11) 3U (m) me + 5 (1-b)U (m) (1-a)me+ ¿PU (a — =) m) (1 — a) me 
= i. |i«a-9a-94 E. 
2 1— a 
In equilibrium, expected cost equals expected gain so 
1 pe a 1 b(1—a) 
.3.12 == |2 + — | = že |1 1— 1— _— |. 
(S8.3.12) a “| xh b)(1-a)+ == 
Thus 
_ [2(6—a)+a” (1 — b) 
(88.3.13) p= | AT m, 


which gives the previous result when b — 1, and p= (1 — a) m when b= a. 
The expected gross return on the claim is : 


3m+3(1-a)m 

p 
(2 — a) [2 (b — a) + ba] 
4 (b — a) + 2a? (1 — b)’ 


so when b — 1 we obtain the same result as before, 


(S8.3.14) 1+R® 


(S8.3.15) 


2 — a) 
3.1 14 RO = | =1+ R(9 
(S8.3.16) +R eog E 
and when b= a, 
2 — a) 2 
FR ugt 29r eee 1+ RO». 
(S8.3.17) +R 2 —a) ENS +R) 


Since 0 < a < 1, R® > RV. Therefore, heterogeneity in the form of individual 
uninsurable risk increases the expected return on the asset. 

8.3.3 The literature on representative agent models tends to find that average stock re- 
turns are higher than can be explained with plausible degrees of risk aversion. Uninsurable 
individual risk might be one explanation. 

This problem is based on N. Gregory Mankiw’s “The Equity Premium and the 
Concentration of Aggregate Shocks”, Journal of Financial Economics, September 1986. 
Mankiw shows that one gets similar results for any utility function with U” >0. Qua- 
dratic utility has U”” = 0 (“certainty equivalence") and uninsurable individual risk has 
no effect. 
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The result also depends on the fact that there is more dispersion of individual endow- 
ments in bad times than in good times. 
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Problems in Chapter 9 


Solution 9.1 


Without loss of generality, let us consider the random variable p, (T) (the derivation for 
pn(t) is analogous). Denote the moment-generating functions of the increments ez in 
(9.1.1) and pn (T) in (9.1.2) as M.(r) and M,(r) respectively, where 


(S9.1.1) M.(r) = Ele™®] 

(S9.1.2) = pe sage 

(S9.1.3) M,(r) = Eje”? 0] = Ele” Fr=ı eh] 

(S9.1.4) = E[] [ e^**] = [| Ele] 
k=1 k=1 

(S9.1.5) = | re^ + a- mea | 


Recall from (9.1.7) that m = 4 (1 + T and A = eh. This implies 
ii 
M,(r) = EZ 5 IE ge + 
1 = n 
(S9.1.6) (dee: hy, TEME | 
2 o 
h n 
(S9.1.7) = | cosh(roVh) + BR sinh(to Vh) | 
where cosh(x) and sinh(x) are the hyperbolic sine and cosine functions 
e+e” © a 
cosh(z) = — ie a ps 
inh cc. HOO Ue zx? q 
sinh(x) = RU > Epp 


Simplifying and letting n — oo yields 


(S9.1.8) Myr) = [d 


gene 
(S9.1.9) a, ee 


which is the moment-generating function for a normal random variable with mean pT and 
variance o°T. 


Solution 9.2 


Denote by 0 =| c? |' and observe that 


ENT 107£(0 | [| —> 0 
(S9.2.1) Z(0) = lim — E E 9806 | - | 0 ES 
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Therefore, the inverse is simply 


(59.2.2) T'(6) = | 1 0 | . 


Solution 9.3 


Consider n observations in the interval [0, T] equally spaced at intervals h = T/n, and let 
p(0) = 0 to simplify the algebra. Let p; = p(kh). Using (9.3.48) we find that 


kh 
(S9.3.1) pr — e7 pp-1 = uh(k — e ?^(k —1)) + of ei gs). 
kh-h 
Now let p = e^?^. Then we can rewrite (89.3.1) as 
(89.3.2) Pr — ppr-ı — ph(k — p(k — 1)) = ocer , 
where e; ~ N (0, 1) and 
kh = 
(S9.3.3) o? = Var le f g UP CIE = = (1 guy. 
kh—h Y 


We now derive the maximum likelihood estimators ji, ô and 62 from which we can obtain 
A ^2 
^ and 0” as: 


22 246: 


ent ol ^ "EMI e 
(S9.3.4) y—-yle(, o = (1.23) 


by the Principle of Invariance (see Zehna [1966]). The log-likelihood function is given by 


2 n 2 1 = 2 
(89.3.5) £(u p.07) = —5 108 (210?) — 5D [pr — ppr-1 — uhlk — plk —1))] 


€ k=1 


and the necessary first-order conditions for the maximum of the log-likelihood function 
are 


OL 1x 

da 7 GE ba pea — wh(k — p(k — 1) — p(k — 1)) — 0, 
€ k=1 

OL 1x 

I v [pr — ppr-ı — ph(k — p(k — 1)))(pe-1 — uh(k — 1)) = 0, 
€ k—1 

OL n 1 Dx 

om = esc ppr—1 — ph(k — p(k —1))]? = 0. 

€ € € k—1 


These conditions can be written as a system of equations in (ji, 6,62): 


(89.3.6) po = Deter Pe = Apna) (k= 66-1) 


ga li — Alk 1)? 
(S9.3.7) ô = IT Inn AME DP 


(89.3.8) à? = LY p. ppr- — AM — Alk Df. 
k=1 
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From here we will assume that the trend y is known exactly. Then we can calculate 


OL 1x 
zz 7 TZ Y lpr- — ph(k — DP, 
p E koI 
DR. iok 
Op0c2 c2 Op’ 
ÖL ni 1i% 
Mod? 7 Dad 5$ 22 — ppe- — uhlk — plk DI. 
€ E € k=1 
But observe that 
ir. _ limno y ar (Pr-ı = ph(k — 1))* 
noo n Op? (5,62) limno 62 
m 1 
~ 1-e-?' 
2 
lim —E 1 IL = 0, 
n>00 n OpOc? |(5.52; 
; 1 @L ; 1, n1 1 
een M ao ES] ur 
Using (9.3.7), we conclude that 
(89.3.9) val =p) & N(0,1-e2), 
(S9.3.10) vVn(8?—e2) & N(0,207). 


The asymptotic distribution of y and a? can now be obtained using (S9.3.4) and the delta 
method described in the Appendix A.4: 


(593.11) Vn- £ N (o, ale" - 1) ; 
(89.3.12) y/n(6à) c) E N (0 20° (1 + pl cr pee 220) 


To derive the continuous-record asymptotics of y and ó2, we let n — oo while T is 
held fixed, hence h — T'/n — 0. Since 


|. 1 


i ` 1 2 
(S9.3.13) 7 = —ylog(1 - (1- 9) = (0-6) + olh) = 7 + (7), 
we conclude that 
ui ces — yt — ut 

(S9.3.14) y Z nom (p. LES Pra - Hn ) , 

z alo 1 — u] 
The denominator converges to 

T 

(89.3.15) f GE pi ds, 

0 
while the numerator converges to 

T T 
($9.3.16) f 00-994 (9) us) ds 
0 0 


We conclude that 
fo (p(s) — us) dp(s) — n fa (p(s) — us) ds 


89.3.17 y E — 
cas : JE (rs) = ns)? ds 
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which simplifies to 

Jo as) da(s) 

a. 
So q(s)? ds 

where q(t) = p(t) — pt. Finally, it can be shown that 


(89.3.18) 4g — 


2 _ 18 ^ a 
(89.3.19) ô? = = > gr — pari)’ To”. 
k=1 


Solution 9.4 


9.4.1 The maximum likelihood estimates (9.3.27), (9.3.28) are evaluated using daily re- 

turns from January 2, 1991 to December 29, 1995 and assuming h = 1/253, i.e., 253 

trading periods in a year. The riskfree interest rate is set to r = 5%. The estimates are: 
^2 


ô? = 0.074 , =r- Z = 0.0131. 


9.4.2 Two variants of the Monte Carlo method are used: 


1. The crude method of Section 9.4.1. 
2. The antithetic variates method of Section 9.4.4. 


The initial stock price (the closing price on December 29, 1995) is $91.375. 100,000 
replications are used in both cases (m = 100,000). The number of discrete intervals is 
n = 253. 

The crude Monte Carlo method produces an estimate according to (9.4.6) of: 


A (0) = $17.70 . 
The standard deviation of the estimate Ĥ (0) is estimated according to (9.4.10) as: 
ô, (253) = $19.60 . 
Therefore, according to (9.4.8), a 95% confidence interval is 
$17.58 < H(0) < $17.82 . 


The minimum number of replications necessary to yield a price estimate within $0.05 of 
the true price is estimated according to (9.4.9): 


m > 5.905 x 10° . 
The antithetic variates method produces an estimate according to (9.4.13): 
H(0) = $17.63 . 
Standard deviation of the estimate H(0) is estimated according to (9.4.15): 
6,(253) = $8.56 . 
As a result, according to (9.4.8), a 9596 confidence interval is 
$17.58 < H(0) < $17.69 . 


The minimum number of replications necessary to yield a price estimate within $0.05 of 
the true price is estimated according to (9.4.9) as: 


m > 1.126 x 10°. 


9.4.3 The closed-form solution for the option price is given by the Goldman-Sosin-Gatto 
formula (9.4.11) and is evaluated using the estimate of a? obtained in Problem 9.4.1: 


H(0) — $18.91 . 


The difference between the theoretical price H(0) and our estimate H(0) arises from 
the difference between the maximum of discretely-sampled and continuously-sampled 
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prices. Specifically, the theoretical price H(0) of the option is evaluated under the as- 
sumption that the option allows one to sell the stock at the maximum price observed over 
the course of the entire year. The estimate H (0) was obtained under the assumption that 
only daily closing prices are used to evaluate the maximum. Obviously, the first definition 
always leads to a higher option price than the second. 

In the context of this particular problem the second definition of the option (the one 
used in Monte Carlo simulations) is more relevant, since it is based on the definition of 
the actual option. The Goldman-Sosin-Gatto formula is a continuous-time approximation 
to this option. Therefore, the Monte Carlo estimator of the option price should be used 
to decide whether to accept or reject CLM's proposal. 
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Problems in Chapter 10 


Solution 10.1 


10.1.1 Prices of the zero-coupon bonds are P4 = e~8*°! = 0.4829 and Pg = e 9x0 y 
0.4868 per dollar of their face values. Since nominal interest rates cannot be negative, the 
finding that Pa < Pg implies an arbitrage opportunity and is inconsistent with any ex- 
pectations theory. 

10.1.2 Prices of zero-coupon bonds are now P4 = e^ /*999! = 0.5289 and Ph =e 
0.5273 per dollar of their face values. As P > Pj in this case, the prices do not imply 
an arbitrage opportunity and may be consistent with the pure expectations hypothesis. 
10.1.3 Let us assume the coupon payments are annual and are made at the end of the 
year. Consider first the case analogous to Problem 10.1.1. Prices do not now imply an 
arbitrage opportunity. As an example, assume that all one- to eight-year zero-coupon 
bonds have price Pg per one dollar of their face value, and that the nine-year zero-coupon 
bond has price Pg. Under these assumptions we can express the prices as 


—8x0.08 | 
N 


Pa 
1. Py = ——*— x 0.2944 
erty) 3 148x008 04945 
Pg — 8 x 0.08 Ps 
1. Po = So 20.2752. 
(S10.1.2) b wen: 0.275 


We see that, under this non-stochastic term structure given by Pg and Po, all interest 
rates are nonnegative and Pg > Po, so that no arbitrage opportunity exists. 

Now, consider the case analogous to Problem 10.1.2. Assume that all one- to seven- 
year zero-coupon bonds have price P; per one dollar of their face value and that eight-year 
zero-coupon bond has price P%. Under these assumptions we can express the prices as 


P! 
10.1. P} = — x 0.3390 
(S10.1.3) T 1+7 x 0.08 j 
Ph —7 x 0.08P! 
14 PE, SB ee 0.3195, 
(S10.1.4) A 2008 0.3125 


P; > Pz, so again there is no arbitrage opportunity. 

Note however that the assumptions required to rationalize these bond prices are rather 
extreme, since they require zero nominal interest rates between one and eight years. The 
loglinear approximate model for coupon bonds presented in (10.1.20) gives a different 
answer. This model effectively imposes “smoothness” on the term structure. Equation 
(10.1.20) allows us to compute the implicit n-period-ahead 1-period log forward rate given 
the coupon-bond duration Dent in (10.1.10), which in turn requires the coupon-bond price 
Pent in (10.1.9). 

For the data in Problem 10.1.1 we have 


(S10.1.5) Post .9171; Deot = 6.1186 years 
Pest = .9797; Degi = 6.7212 years, 


so (10.1.20) gives 
fe: = —3.168496 < 0 
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Notice that the bonds are not selling at par, so it is not correct to use the simpler formula 
for Dent that obtains in this case. This result again implies under the log pure expectations 
hypothesis a negative one-period log yield 8 periods ahead. 
Similarly, for the data in Problem 10.1.2 we have 
(S10.1.6) Pen = .9245; Der = 5.5615 years 
Pest = .9813; Dest = 6.1876 years, 


so (10.1.20) gives 
fu —1.7711% < 0 


which implies under the log pure expectations hypothesis a negative one-period log yield 
7 periods ahead. Using this approach, we find that coupon bonds violate the log PEH 
more than zero-coupon bonds. The reason is that the duration of coupon bonds does 
not increase linearly with their maturity, but increases at a decreasing rate. That is, 
Dem+1,t— De,n,t < 1. This in turn makes it easier to get negative forward rates for given 
yields. 


Solution 10.2 


10.2.1 Assume the postulated process and simplify notation, introducing a; = yu — yi,t-1 
and b, = y»: — yit- The equations of the model can then be written as 


I at = Abi +e, 
1 
(S10.2.1) II b, = 5 Edacci] + Lt; 
III tt = d@t-1+M, 
IV at = YL + €t. 


¿From the first and fourth equations we get b, = yA~'a; from the third and fourth 
equations we get E,la¿+1] = yore; the second equation then gives an expression for the 
coefficient y in terms of the other parameters of the model, 


(810.2.2) ya € Au) 


It is straightforward to verify that with this value for y, the yi; process satisfies all the 
equations of the model, provided that Ad « 2. 
10.2.2 Using notation from Problem 10.2.1, the regression has the form 


(810.2.3) at+1/2 = a4 bi + uia. 


As E¿[ar+1] = yor: and b = yA” xi, we see that the population parameters are a = 0 
and 3 = Ad/2. Clearly 8 < 1 since we have required Ag < 2. 

10.2.3 Assume the process of the given form and simplify notation, introducing a, = 
yit — yit-1 and b S Ynt — Yit. Note that 


(810.2.4) Jn,t41 — Ynt = bii + 0441 — be. 


The equations of the model and of the postulated process are then 
I at = Abi + ei, 
(810.2.5) II bi = (n = DE¿[bi+1 + a141 — bi] + tt, 
II Lt = QLi-1 + Ne, 
IV at = YL + €t. 


¿From the first and fourth equations we get b; = yA” ! zi; from the third we get E:[b:+1] = 
yA7* ózi; from the third and fourth we get Ei[a441] = yoz; and the second equation then 
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gives the condition for the parameter y: 
A 
n — (n —1)9(1-- A): 
It is straightforward to verify that with this value of y, the yiz process satisfies all the 
equations of the model, provided that (1+ A)¢(n — 1) < n. 
In our notation, the regression takes the form 


(S10.2.6) y= 


(S10.2.7) bisa + Qt41 — bb =at pe + um. 

As Elbi41+a1+1—bi] = (yA '@+yb—YA7| )gi and bj = yA7 | zi, we see that the population 
parameters are a = 0 and 8 = (1 + A)¢(n — 1) — (n — 1). The parameter restrictions we 
have imposed allow 8 to be either positive or negative. 

10.2.4 The model does explain why short-rate regressions of the type explored in Problem 
10.2.2 give coefficients positive but less than one, while long-rate regressions of the type 
explored in Problem 10.2.3 often give negative coefficients. The underlying mechanism is 
a time-varying term premium, interacting with the desire of the monetary authority to 
smooth interest rates. 

A limitation of this model is that it assumes a nonstationary interest rate process, 
which has unsatisfactory long-run properties. For example, with probability one the in- 
terest rate eventually becomes negative. Bennett McCallum, “Monetary Policy and the 
Term Structure of Interest Rates”, NBER Working Paper No. 4938, 1994, works out a 
stationary version of this model; the algebra is more complicated but the properties of the 
model are similar. 
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Problems in Chapter 11 


Solution 11.1 


11.1.1 We assume throughout the problem that bond prices are determined by the ho- 
moskedastic lognormal model implied by equations (11.1.5) and (11.1.3), 


(S11.1.1) —mui = et Pb 
(S11.1.2) Sui = (1—-4)ptÓmit Ema, 


with & ~ N(0,0?), but to fit the current term structure of interest rates we assume instead 
that the state variable follows the process given in equation (11.3.4): 


(S11.1.3) Lei = Digit + gen + Etti- 


A useful way to relate the deterministic drift terms g:+; and the parameters of the 
true pricing model when fitting the term structure of interest rates is to compute the 
forward rates implied by the assumed model (S11.1.1) and (S11.1.3), and compare them 
with those implied by the true model (S11.1.1) and (S11.1.2). To compute the forward 
rates implied by the assumed model we need first to compute the log bond prices, since 
Inst = pns — Pn+1,+- Using equality (11.0.2) and the lognormal property of the stochastic 
discount factor, we have that 


n 
II Miri 


i=l 
n 1 n 
2. Mt+i + 3 Var: (E ms) 2 


But from the assumed model for the state variable en 1.3) we have 


Pn,t log E; 


(S11.1.4) epi = mu “Eso + Esos 


so 


n n n 
) mi = = ) &H-1— Ê ) btti 
¿=1 ¿=1 ¿=1 

n n 


nzi— y (n—i)ge — 9 (8 n - 0? Er 


i=1 i=1 


and 
Z 1 
(S11.1.5) Prat m a e= giri + par B+n-i)o 
We can now use (S11.1.5) to compute forward rates implied by the assumed model: 
Int = Pat Prit 
n 
(S11.1.6) = wet ger — > (94m)? o”. 
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Comparing (S11.1.6) with equation (11.1.14), that gives us the forward rates implied by 
the true model, we find immediately that the drift terms g+; are related to the parameters 
of the true model by the following expression: 
ilm 
(+ E) - eer 


$ 


11.1.2 Since rı 441 = -Eı [mi +1] — Varı (mi) /2, the short term interest rates at (t+ 1) 
implied by the assumed model and the true model are the same: 


2 
oOo. 


(S11.1.7) D oi =- (1— 0”) (z: — p) -5 


1 
(S11.1.8) T1,t4+1 = Té — 387. 


The dynamics of the state variable in the true model, given by (S11.1.2), and (S11.1.8) 
imply that future short rates equal: 


n 
i 1 
Titten = p(l- o”) +9 "20 + 2, pour PLA 
= 


= ru — (1— 6") (te — i) + 2 96s 


so the expected future log short rates in the true model are 


(S11.1.9) Es [ritn] = rie — (1 — 0") (ae — p). 
The dynamics of the state variable in the assumed model, given by (S11.1.3), imply: 
n n 1 
Tijépn4l = Ur at F be 23 fg? 
1= Tux 


n n 
= Ti+ + 5 Jt+i + 5 Eiti, 
i=1 i=l 


so expected future log short rates under the assumed model are 


n 
(S11.1.10) Es [ri t4n41] = 11,41 + ba 


i=1 


Therefore, if we choose the drift terms so 
(S11.1.11) XO gi =- (1 — 6") (zi— n), 
i=1 


the assumed model will be able to reproduce the expected short rates. However, by 
comparing (S11.1.7) and (S11.1.11) we can see that it is not possible to choose drift terms 
so they match simultaneously both current forward rates and expected future log short 
rates, since 


1-9" 2 
(+ =E) 40m 
unless ¢ > 1, i.e., unless the state variable in the true model follows a random walk. It is 
also interesting to note that the set of deterministic drifts that matches expected future 
log short rates—see equation (S11.1.11)—converges to — (x: — u) as n — co, while the set 
of deterministic drifts that matches forward rates—see equation (S11.1.7)— tends to —oo 
as n — oo. Therefore, if we choose the drift terms so they reproduce the forward rate 
structure of the true model, this will result in expected future log short rates declining 
without bound as we increase the horizon, while the true model implies that the expected 
future log short rates converge to a finite constant. 
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11.1.3 ¿From equation (11.1.8) and (S11.1.2), the time t conditional variance of log bond 
prices at time t + 1 implied by the true bond pricing model is 


Varı (pntti) = Bp-1Er [vezi — Erze] 
A » 
(S11.1.12) - (5$) o’, 


while from (S11.1.5) and (S11.1.3), the time t conditional variance of log bond prices at 
time t+ 1 implied by the assumed bond pricing model is 


(S11.1.13) Vari (Pn t41) = no”. 


Hence (S11.1.13) cannot be equal to (S11.1.12) unless ¢ > 1, i.e. unless the state variable 
follows a random walk in the true model. Moreover, for n > 1, the conditional variance 
of log bond prices implied by the assumed model is larger than the conditional variance 
implied by the true model and, while the true model implies that the conditional variance 
of log bond prices is bounded at o?/(1 — $)? as n — oo, the assumed model implies an 
unbounded conditional variance. 

11.1.4 Section 11.3.3 shows that the price of a European call option written on a zero- 
coupon that matures n +7 periods from now, with n periods to expiration and strike price 
X, is given under the true model by 


Cnt (X) = PaL o (di) +X Pn,t P (d2), 


where Pa, = exp{pn,t} = exp{An + Bng} is the price of the bond, ®(o) denotes the 
cumulative distribution function of a standard normal random variable, 


Dntrt — £ — Dnt + Vart (Pr,t+n) /2 


LS, 
Vari (Pr,t+n) 
da = di — X Vari (prin), 


x = log(X) and 


Var, (Pr,t+n) B? Var; (tt4n) 
teg N /1-¢"\ 5 
11.1.14 = 
um ld, 
In our assumed model we use the same formula to value the option, except that we 


need to compute Var: (Pr,t+n) under our assumed process for the state variable (S11.1.3). 
From (S11.1.5), we have 


Var, (Pr,t+n) = n? Var; (zin) 
(S11.1.15) = Tno’, 


where the second line follows from (S11.1.4). 

Obviously, (S11.1.15) differs from (S11.1.14), unless ¢ — 1, so in general the assumed 
model will misprice options. For r > 1 and/or n > 1, it will overstate the volatility of 
the future log bond price, hence overvaluing the option. This overvaluation increases with 
the expiration date of the option and/or the maturity of the underlying bond. This is 
true no matter what combination of the drift parameters we choose. Backus, Foresi and 
Zin (1996) use this result to caution against the popular practice among practitioners of 
augmenting standard arbitrage-free bond pricing models with time-dependent parameters 
to fit exactly the yield curve. This augmentation may seriously misprice state-contingent 
claims, even though it is able to exactly reproduce the prices of some derivative securities. 
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Solution 11.2 


11.2.1 The homoskedastic single-factor term-structure model of Section 11.1.1 holds: 


(S11.2.1) Smii = Tt + Cua 
(S11.2.2) L+1 = (1 = Pu + or + Et41. 
Thus, the price function for an n-period bond is 
(S11.2.3) —Pnt = An + Bni 
with 
1-49" 
(S11.2.4) Ba = 1+ @Bn-1 = ; 
1-9 
(S11.2.5) An — An-ı = (1 — $)uBa-ı — (8+ Bai) 0? /2, 


and Ao = Bo = 0. 

Equation (11.3.15) in CLM gives the price at time t of an n-period forward contract 
on a zero coupon-bond which matures at time t+n +7 as Grnt = Prin,t/Pnt. Taking 
logs, grent = Pr+n,t — Pnt- Substituting out prin,e and pa: using (S11.2.3)-(S11.2.5) yields: 
(S11.2.6) —Irnt = (An+r == An) + (Batr za Bn) xt. 

Thus, the pricing function for an n-period forward contract on a zero coupon-bond 
which matures at time t + n +7 is given by: 

(S11.2.7) —Irnt = A24 + B? „xt, 
with 
Añn = An+r — An 
Bin = Bnr — Bn, 
where (S11.2.4) and (S11.2.5) can be used to write An+r, Bn+7 as functions of An, Bn. 
Clearly, the log forward price grn: is affine in the state variable z+. 

In order to show that the log futures price h-nt is also affine in the state variable we 
can use equation (11.3.10) in CLM: 

(S11.2.8) Henri = Er [Mirar Hy n-1,t41/Prt] 


Taking logs and assuming joint lognormality: 


1 
(S11.2.9) Rene = Es [Mia + hrn- 1,41 — Pre] + 3 Var: [mii + hrm-1,t41 — Pit]. 


Let us first determine h,1+. Since hz 0,41 = Pr,++1wWe have that: 


1 
(S11.2.10) hrit Ema + pesi — pit] + 3 Vari [Mi41 + peti — pit] 
Substituting out mı+ı using (811.2.1) and p, ¿+1, pı: using (S11.2.3) and (S11.2.2) yields: 


he = ef xt — PEryr — Ar — B- (1 — ġ)u 


(S11.2.11) Bot: — B-&pı +: — gen + 


Z vanf Le — bry — Ar — B- (1 — ġ)p 


B, out ET B.fua + Xt— go] . 


Since Etét+1 = 0 and Var; £141 = ce? it follows that: 
(S11.2.12) —hr = AT, + Bh 
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with 
Af, = Ar + (1 — &)uB, — o? [2 [-8° + (6+ B,)”] 


Bi =0B,. 


Let us now solve for hrni. We guess that —hrnt = Ar, + B*, x4. We proceed to verify 
our guess. At the same time we derive formulas for the coefficients A^,, B^, as functions 
of the term structure coefficients An, Bn. Proceeding as above: 


1 
(S11.2.13) Rene = Es [Mia + hacia — pre] + 3 Vari [miga + hr,n-1,t+1 — pit] 


and using our guess to substitute out for hz, 1,6: 


Rent = ef Ti — Pérgr — Abu a — Bin-ı(l - ))u— 


(S11.2.14) Br ,-1¢at — B? pii + 24 — ge + 


1 
Z van | ot — Ber — Aaa — Be yi — ))u— 


B} ibe = Hii +2: — gen s 
We obtain: 
(S11.2.15) 
-hrni = Arama + Br (1— a 0°/2 |-0 + (8+ Bh Y] + 832-180. 
Thus 
Aba = Al acu zn B? „(1 = P)u m c^ [2 [-8 Gg (8 + Er] > 
Bra = B? ni 
Solving recursively and using B^, = HB, yields 
(S11.2.16) Ah, Ab = BP — 6) — o? f2 [-0° + (8+ 9"  B.'], 
Be = $” Bz. 


This completes Part 11.2.1. 
11.2.2 The log ratio of forward to futures prices is given by 


(S11.2.17) Gent — hrni = (An — Antr) + (Bn — Batr)zı + A”, + BP, nn. 


In order to show that this is constant we need to show that B, — Bay, + B^, = 0. 
Straightforward algebra gives us: 


(S11.2.18) Bn — Bnr -Bh, = Ba-Bn+r+9"B,= 
u lcu ST ic eg 
= = =0. 


Showing that the ratio of forward to future prices is greater than one is equivalent to 
showing that the log ratio is greater than zero. In order to do so we write 


Irnt—hrnt = An — An+r + An = 
An-1+(1— d)uBn-ı — (8 + Bie 429 
—An4r-i — (1 — ġ)uBn+r-1 (B + Buy+-1) 0^ /2 
Aba + (1 Ba 0/2 |-8 + (84 Bh]. 
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It can easily be checked that the terms in (1 — f)u add up to zero. Given the recursive 
nature of the problem and remembering that grızt — hrıt = 0 we have that 


(81.249) gent — Mene = —0°/2 Y [(6 + Bi)” - (0 Bitr)? - PP + (8 + e BY]. 
j=0 


Using the fact that ¢’ B, = Bj+- — Bj, after some algebra, we obtain 


_ P-E- $7") - g") 
(S11.2.20) gra — hen AA 
Thus we have that 


(S11.2.21) Grnt — hrni > 0 when 0<¢<1 
Grnt — Arent < 0 when -1<¢<0. 


The difference between a futures contract and a forward contract is that the first is 
marked to market each period during the life of the contract, so that the purchaser of 
a futures contract receives the futures price increase or pays the futures price decrease 
each period. When interest rates are random, these mark-to-market payments may be 
correlated with interest rates. When 0 < ¢ < 1, so that Cov(Arnt, yit) < 0, the purchaser 
of a futures contract tends to receive the futures price increase at times when interest rates 
are low, and tends to pay the futures price decrease at times when interest rates are high, 
making the futures contract worth less than the forward contract. On the other hand, 
when —1 < ¢ < 0, the futures contract will be worth more than the forward contract 
since its purchaser tends to receive price increases when interest rates are high (so that 
the money can be invested at a high rate of return). 

11.2.3 The parameter values for this part, $ = 0.98 and o? = 0.00051?, can be found in 
Section 11.2.2, page 453 (and not in Section 11.1.2). 


Problems in Chapter 12 


Solution 12.1 


12.1.1 There are several criteria with which random number generators can be judged: 


e Stochastic quality of apparent randomness, as reflected in the probabilistic prop- 
erties of generated sample and assessed by batteries of statistical tests of indepen- 
dence, goodness-of-fit to specific probability distributions, etc. 

e Computational efficiency, in terms of cost of implementation, resource require- 
ments, volume of output per second, volume of output in absolute terms, all with- 
out deterioration of stochastic quality. 

e Portability of the algorithm. 

e Heproducibility of random series (based on the initial “seed” of the random number 
generator). 


The ultimate introduction to the science and art of pseudorandom number generation 
is Chapter 3 of D. E. Knuth's classics The Art of Computer Programming 1969, 1981, where 
the most influential and comprehensive study of the subject is to be found. 

One example of the many recent treatises on the state of the art is Fishman (1996), 
which emphasizes pseudorandom number generators in Chapter 7. High-quality pseudo- 
random number generators also emerge in cryptography. Cryptographically secure gen- 
erators, related to stream ciphers and one-way hash functions achieve extraordinary sto- 
chastic quality, generally at the expense of increasing computation costs. See for example 
Schneier (1996, Chapters 16-18). 

There exist batteries of statistical tests intended to measure stochastic quality of pseu- 

dorandom number generators; these include tests such as chi-square, Kolmogorov-Smirnov, 
frequency, serial, gap, permutation, run, moments, serial correlation, and especially spec- 
tral tests (see Knuth [1969] for details); or, for example, an omnibus test assessing joint 
independence and one- to three-dimensional uniformity, assembled by Fishman (1996, 
Section 7.12). 
12.1.2 Generally, very well researched and tested MLCG generators constitute an accepted 
pragmatic compromise among the criteria imposed on pseudorandom number generators 
discussed in Problem 12.1.1. The proper choice of parameters of MLCG generators is 
essential, and theoretical guidelines are readily available in Knuth (1969) and elsewhere. 
The quality of the tent- and logistic-map generators is inferior for most purposes, as most 
standard statistical tests of randomness will show. 

The extra modification of using parameters like 1.99999999 instead of 2 etc. patches 
the most obvious flaw of the tent- and logistic-map generators: with real numbers rep- 
resented in binary form using finite-length mantissas, repetitive multiplication by 2 de- 
teriorates quality of the sequence rapidly, i.e., the sequence degenerates in time that is 
proportional to the mantissa length. In most practical cases, though, the use of well- 
researched pseudorandom number generators with solid theoretical guarantees of quality, 
such as MLCG, is indicated. 

If the quality of even properly chosen MLCG is not sufficient for an application at 
hand, one may consider using some other classes of well-tested generators with balanced 
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Table Estimates of kernel-regression betas of IBM relative to S&P 500 based on monthly 
return data from 1965:1 to 1994:12. Each estimate is local to a particular level of S&P 
500 monthly return. 


SP500 [%] 61BM,SP500 


—15 1.366 
—10 1.395 
—5 0.689 
0 0.666 

5 0.806 

10 0.531 
15 1.994 


TABLE 12.1. IBM Betas Relative to S&P 500 


quality-cost tradeoffs, for example, Marsaglia's lagged-Fibonacci generators (see Marsaglia 
and Zaman [1991]). 


Solution 12.2 


Equations (12.4.1) and (12.4.3) describe one unit. Our case involves ten such units with 
J =5. The output layer is given by equation (12.4.4) with K = 10. For simplicity, 
choose h(-) to be the identity, in accord with the discussion on pages 514-542. Thus, 
the nonlinear model has 60 parameters to fit. Using a nonlinear optimization technique 
of choice, find the parameter values that attain the minimum (beware of local minima!) 
in-sample root-mean-squared-error (RMSE) of the one-step-ahead estimate, with identical 
weights given to each datapoint of S&P 500 returns from 1926:1 to 1985:12. Then, apply 
the fitted perceptron parameters on data in period from 1986:1 to 1994:12. 

The RMSE will be substantially larger in the out-of sample period than in the in- 
sample period. The out-of-sample RMSE 60-parameter perceptron will probably not be 
drastically smaller than out-of-sample RMSE of a linear model with less immodest number 
of parameters (say, ten (10); consider an OLS regression with five lagged returns and 
their squares as explanatory variables), but the in-sample RMSE of the former will be 
noticeably smaller than RMSE of the latter. This phenomenon can be related to concept 
of “overfitting” which occurs when lack of structural, qualitative information of the data 
generating stochastic process is countered by increase in number of ad hoc degrees of 
freedom in the model: this procedure results in excellent in-sample fit while out-of-sample 
performance stays mediocre. 


Solution 12.3 


First implement the kernel regression estimator m» (x) according to formula (12.3.9) with 
a Gaussian kernel K,(x) as in (12.3.10). Second, determine optimal bandwidth by mini- 
mizing the cross-validation function CV (h) as in (12.3.13), based on estimator 1h; (x) and 
given historical S&P 500 and IBM monthly returns. 

Numerically, the appropriate bandwidth for period from 1965:1 to 1994:12 is h = 
1.49% (the scale is in monthly returns of S&P 500). The resulting regression is plotted 
(Figure 12.1). 

The analog of the conventional beta estimate here is the quantity Im, (x)/Ox, evalu- 
ated at particular level of S&P 500 return x. See Section (12.3.3) for a detailed discussion 
of average derivative estimators. 

Let us replace derivative by its discrete analog with a step length difference of 1% of 
the S&P 500 monthly return. The resulting estimates of ß’s for different levels of S&P 
500 returns is shown in Table 12.1. 
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Returns in Period 1965:1 - 1994:12; Kernel Regression 


20 
| 


10 


IBM Monthly Return [%] 
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T T T T 
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SP500 Monthly Return [%] 


FIGURE 12.1. Kernel Regression of IBM Returns on S&P 500 Returns 


We see that the local estimates of beta vary considerably, most likely due to the 
relatively small number of datapoints in the estimation, possible variation in beta over 
time, or genuine nonlinearity of the relation between IBM and S&P 500 monthly returns. 

Some advantages of kernel regression relative to ordinary least squares are: cross- 
validation allows for nonparametric, adaptive and asymptotically consistent estimation of 
the true relation between IBM and S&P 500 returns even when this relation is not linear; 
the kernel estimator m(x) conveys more information about the relationship than a single 
parameter (8) and allows easy visualization of the relation. 


